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Abstract 

Recently, quantitative versions of the Gibbard-Satterthwaite theorem were proven for k — 3 alterna- 
tives by Friedgut, Kalai, Keller and Nisan and for neutral functions on fc > 4 alternatives by Isaksson, 
Kindler and Mossel. 

We prove a quantitative version of the Gibbard-Satterthwaite theorem for general social choice func- 
tions for any number fc > 3 of alternatives. In particular we show that for a social choice function / on 
k > 3 alternatives and n voters, which is e-far from the family of nonmanipulable functions, a uniformly 
chosen voter profile is manipulable with probability at least inverse polynomial in n, k, and . 

Removing the neutrality assumption of previous theorems is important for multiple reasons. For one, 
it is known that there is a conflict between anonymity and neutrality, and since most common voting rules 
are anonymous, they cannot always be neutral. Second, virtual elections are used in many applications in 
artificial intelligence, where there are often restrictions on the outcome of the election, and so neutrality 
is not a natural assumption in these situations. 

Ours is a unified proof which in particular covers all previous cases established before. The proof 
crucially uses reverse hypercontractivity in addition to several ideas from the two previous proofs. Much 
of the work is devoted to understanding functions of a single voter, and in particular we also prove a 
quantitative Gibbard-Satterthwaite theorem for one voter. 

1 Introduction 

One of the main goals in social choice theory is to come up with "good" voting systems, which satisfy a few 
natural requirements. This problem is increasingly relevant in the area of artificial intelligence and computer 
science as well, where virtual elections are now an established tool in preference aggregation (see the survey 
by Faliszewski and Procaccia [5]). Many of the results in the study of social choice are negative: it is 
impossible to design a voting system that satisfies a few desired properties all at once. The first realization 
of an apparent problem is due to Condorcet, who, at the end of the 18"^ century, noticed the following 
paradox: when ranking three candidates, a, b, and c, it may happen that a majority of voters prefer a over 6, 
a majority prefers b over c, and a majority prefers c over a, thus producing an "irrational" circular ranking 
of the candidates. Arrow's impossibility theorem [1] [2] showed that this paradox holds under very natural 
assumptions, thus marking the basis of modern social choice theory. 

A naturally desirable property of a voting system is strategyproofness (a.k.a. nonmanipulability) : no 
voter should benefit from voting strategically, i.e. voting not according to her true preferences. However, 
Gibbard TT] and Satterthwaite showed that no reasonable voting system can be strategyproof. Before 
stating their result, let us specify the problem more formally. 

We consider n voters electing a winner among k alternatives. The voters specify their opinion by ranking 
the alternatives, and the winner is determined according to some predefined social choice function (SCF) 
f : S"^ ^ [fc] of all the voters' rankings, where Sk denotes the set of all possible total orderings of the k 
alternatives. We call a collection of rankings by the voters a ranking profile. We say that a SCF is manipulable 
if there exists a ranking profile where a voter can achieve a more desirable outcome of the election according 
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to her true preferences by voting in a way that docs not reflect her true preferences (see Definition [T] for a 
more detailed definition). 

The Gibbard-Satterthwaite theorem states that any SCF which is not a dictatorship (i.e. not a function 
of a single voter), and which allows at least three alternatives to be elected, is manipulable. This has 
contributed to the realization that it is unlikely to expect truthfulness in voting. Consequently, there have 
been many branches of research devoted to understanding the extent of manipulability of voting systems, 
and to finding ways of circumventing the negative results. 

One approach, introduced by Bartholdi, Tovey and Trick [3], suggests computational complexity as a 
barrier against manipulation: if it is computationally hard for a voter to manipulate, then she would just tell 
the truth (we refer to the survey by Faliszewski and Procaccia [5] for a detailed history of the surrounding 
literature). This is a worst-case approach, and while worst-case hardness of manipulation is a desirable 
property for a SCF to have, this does not tell us anything about typical instances of the problem — is it easy 
or hard to manipulate on average! 

A recent line of research with an average-case algorithmic approach has suggested that manipulation is 
indeed easy on average; see e.g. Kelly [T^, Conitzer and Sandholm and Procaccia and Rosenschein [3T] 
for results on certain restricted classes of SCFs (see also the survey [5]). 

A different approach was taken by Friedgut, Kalai, Keller and Nisan [TU1[3], who looked at the fraction 
of ranking profiles that are manipulable. To put it differently: assuming each voter votes independently and 
uniformly at random (known as the impartial culture assumption in the social choice literature), what is the 
probability that a ranking profile is manipulable? Is it perhaps exponentially small (in the parameters n, 
k), or is it nonnegligible? Of course, if the SCF is nonmanipulable then this probability is zero. Similarly, if 
the SCF is "close" to being nonmanipulable in some sense, then this probability can be small. We say that 
a SCF / is e-far from the family of nonmanipulable functions, if one must change the outcome of / on at 
least an e-fraction of the ranking profiles in order to transform / into a nonmanipulable function. Friedgut 
et al. conjectured that if > 3 and the SCF / is e-far from the family of nonmanipulable functions, then 
the probability of a ranking profile being manipulable is bounded from below by a polynomial in 1/n, 1/k, 
and e. Moreover, they conjectured that a random manipulation will succeed with nonnegligible probability, 
suggesting that manipulation by computational agents in this setting is easy. 

Friedgut et al. proved their conjecture in the case of fc = 3 alternatives, showing a lower bound of 
Ce^/n in the general setting, and C'e'^/n in the case when the SCF is neutral (invariant under changes 
made to the names of the alternatives), where C, C are constants. Note that this result does not have any 
computational consequences, since when there are only /c = 3 alternatives, a computational agent may easily 
try all possible permutations of the alternatives to find a manipulation (if one exists). Several follow-up works 
have since extended this result. First, Xia and Conitzer [24] used the proof technique of Friedgut et al. to 
extend their result to a constant number of alternatives, assuming several additional technical assumptions. 
However, this still does not have any computational consequences, since the result holds only for a constant 
number of alternatives. Dobzinski and Procaccia [6l proved the conjecture in the case of two voters under 
the assumption that the SCF is Pareto optimal. Finally, the latest work is due to Isaksson, Kindler and 
Mossel [15], who proved the conjecture in the case of fc > 4 alternatives with only the added assumption of 
neutrality. Moreover, they showed that a random manipulation which replaces four adjacent alternatives in 
the preference order of the manipulating voter by a random permutation of them succeeds with nonnegligible 
probability. Since this result is valid for any number of (fc > 4) alternatives, it does have computational 
consequences, implying that for neutral SCFs, manipulation by computational agents is easy on average. 

In this paper we remove the neutrality condition and resolve the conjecture of Friedgut et al.: if fc > 3 
and the SCF / is e-far from the family of nonmanipulable functions, then the probability of a ranking profile 
being manipulable is bounded from below by a polynomial in 1/n, 1/fc, and e. We continue by first presenting 
our results, then discussing their implications, and finally we conclude this section by commenting on the 
techniques used in the proof. 

1.1 Basic setup 

Recall that our basic setup consists of n voters electing a winner among k alternatives via a SCF j : S"^ ^ [fc]. 
We now define manipulability in more detail: 
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Definition 1 (Manipulation points). Let a £ S"^ he a ranking profile. Write a > b to denote that alternative 
a is preferred over b by voter i. A SCF f : ^ [k] is manipulable at the ranking profile a Cz if there 
exists a a' G and an i G [n] such that a and a' only differ in the i"* coordinate and 

/(^') > /(^)- 

In this case we also say that a is a manipulation point of f , and that {a, a') is a manipulation pair for f . We 
say that f is manipulable if it is manipulable at some point a. We also say that a is an r-manipulation point 
of f if f has a manipulation pair (cr, cr') such that a' is obtained from a by permuting (at most) r adjacent 
alternatives in one of the coordinates of a. (We allow r > k — any manipulation point is an r-manipulation 
point for r > k.) 

Let M (/) denote the set of manipulation points of the SCF f , and for a given r, let Mr (/) denote the 
set of r-manipulation points of f. When the SCF is obvious from the context, we write simply M and Mr- 

Gibbard and Satterthwaite proved the following theorem. 

Theorem 1.1 (Gibbard-Satterthwaite [UlIS^). Any SCF f: S]^ [k] which takes at least three values and 
is not a dictator (i.e. not a function of only one voter) is manipulable. 

This theorem is tight in the sense that monotone SCFs which are dictators or only have two possible 
outcomes are indeed nonmanipulable (a function is non-monotone, and clearly manipulable, if for some 
ranking profile a voter can change the outcome from, say, a to & by moving a ahead of b in her preference). 
It is useful to introduce a refined notion of a dictator before defining the set of nonmanipulable SCFs. 

Definition 2 (Dictator on a subset). For a subset of alternatives H (- [k], let top^ be the SCF on one voter 
whose output is always the top ranked alternative among those in H . 

Definition 3 (Nonmanipulable SCFs). We denote by NONMANIP = NONMANIP (n, fc) the set of non- 
manipulable SCFs, which is the following: 

NONMANIP (n, k) = {f : S''^ ^ [fc] | / (a) = top^^ (ct,) for some i £ [n] , H C [k] , H ^ 

\^ {f : SJ^ ^ [k] \ f is a monotone function taking on exactly two values} . 

When the parameters n and k are obvious from the context, we omit them. 

Another important class of functions, which is larger than NONMANIP, but which has a simpler de- 
scription, is the following. 

Definition 4. Define, for parameters n and k that remain implicit (when used the parameters will be obvious 
from the context): 

NONMANIP — {f : ^ [k] \ f only depends on one coordinate or takes at most two values}. 

The notation should be thought of as "closure" rather than "complement" . We remark that in [TS] the 
set NONMANIP is denoted by NONMANIP— but these two sets of functions should not be confused. 

As discussed previously, our goal is to study manipulability from a quantitative viewpoint, and in order 
to do so we need to define the distance between SCFs. 

Definition 5 (Distance between SCFs). The distance ^^{f^g) between two SCFs f,g: SJ! [k] is defined as 
the fraction of inputs on which they differ: D (/, g) —P{f (a) ^ g (cr)), where a e S"^ is uniformly selected. 
For a class G of SCFs, we write D (/, G) — miuggc D (/, g). 

The concepts of anonymity and neutrality of SCFs will be important to us, so we define them here. 

Definition 6 (Anonymity). A SCF is anonymous if it is invariant under changes made to the names of 
the voters. More precisely, a SCF / : S*^* — > [k] is anonymous if for every a — (cti, . . . ,(T„) G S"^ and every 

TT e Sn, 

/ (ctI, . . . , Cr„) = / (cr7r(l), • ■ • ,Cr7r(n)) ■ 

Definition 7 (Neutrality). A SCF is neutral if it is invariant under changes made to the names of the 
alternatives. More precisely, a SCF f : SJ^ ^ [k] is neutral if for every a — (cti, . . . , cr„) € S*^ and every 

TT G Sk, 

/ (tT O CTi, . . . , TT O (T„) = TT (/ (d)) . 
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1.2 Our main result 

Our main result, which resolves the conjecture of Friedgut et al. [TUl [5]. is the following. 

Theorem 1.2. Suppose we have n > 1 voters, fc > 3 alternatives, and a SCF f : SJ^ [k] satisfying 
D(/,NONMANIP) > £. Then 



for some polynomial p, where a € SJ^ is selected uniformly. In particular, we show a lower bound of 



for some polynomial q, where a G S"^ is uniformly selected, and a' is obtained from a by uniformly selecting 
a coordinate i £ {I, . . . ,n}, uniformly selecting j £ {1, . . . , n — 3}, and then uniformly randomly permuting 
the following four adjacent alternatives in at: Gi (j) , Gi {j + 1) ,ai {j + 2), and Ui {j + 3). In particular, the 
specific lower bound for P (cr € M4 (/)) implies that we can take g (e, i, -^j = YorrfEspTsr • 

1.3 Discussion 

Our results cover all previous cases for which a quantitative Gibbard-Satterthwaite theorem has been estab- 
lished before. In particular, the main novelty is that neutrality of the SCF is not assumed, and therefore our 
results hold for nonneutral SCFs as well, thereby solving the main open problem of Friedgut, Kalai, Keller 
and Nisan [9] , and Isaksson, Kindler and Mossel [15] . The main message of our results is that the approach 
of masking manipulation behind computational hardness cannot hide manipulations completely even in the 
nonneutral setting. 

Importance of nonneutrality. While neutrality seems like a very natural assumption, there are multiple 
reasons why removing this assumption is important: 

• Anonymity vs. neutrality. It is known that there is a conflict between anonymity and neutrality 
(recall Definitions [6] and [7|). In particular, there are some combinations of n and k when there exists 
no SCF which is both anonymous and neutral. 

Theorem 1.3. \2(A Chapter 2.4-] There exists a SCF on n voters and k alternatives which is both 
anonymous and neutral if and only if k cannot be written as the sum of (non-trivial) divisors of n. 

The difficulty comes from rules governing tie-breaking. Consider the following example: suppose 
n = k = 2, i.e. we have two voters, voter 1 and voter 2, and two alternatives, a and b. Suppose further 
(w.l.o.g.) that when voter 1 prefers a over b and voter 2 prefers b over a then the outcome is a. What 
should the outcome be when voter 1 prefers b over a and voter 2 prefers a over 6? By anonymity the 
outcome should be a for this configuration as well, but by neutrality the outcome should be b. 

Most common voting rules (plurality, Borda count, etc.) break ties in an anonymous way, and therefore 
they cannot be neutral as well (or can only be neutral for special values of n and fc). See Moulin |20l 
Chapter 2.4.] for more on anonymity and neutrality. 

• Nonneutrality in virtual elections. As mentioned before, voting manipulation is a serious issue 
in artificial intelligence and computer science as well, where virtual elections are becoming more and 
more popular as a tool in preference aggregation (see the survey by Faliszewski and Procaccia [5^). For 
example, consider web (meta-)search engines (see e.g. Dwork et al. [71), where one inputs a query and 
the possible outcomes ("alternatives") are the web pages (with the various search engines acting as 
"voters"). Here, due to various restrictions, neutrality is not a natural assumption. For example, there 
can be language-related restrictions: if one searches in English then the top-ranked webpage will also 



(1) 




An immediate consequence is that 



P {{(J, a') is a manipulation pair for f)>q[e,—,— 
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be in English; or safety-related restrictions: if one searches in child-safe mode, then the top-ranked 
webpage cannot have adult content. These restrictions imply that the appropriate aggregating function 
cannot be neutral. 

• Nonneutrality in real-life elections. Although not a common occurrence, there have been cases 
in real-life elections when a candidate is on the ballot, but is actually ineligible — she cannot win the 
election no matter what. In such a case the SCF is necessarily nonneutral. 

In a recent set of local elections in Philadelphia there were actually three such occurences ^23] : one 
of the candidates for the one open Municipal Court slot was not a lawyer, which is a prerequisite for 
someone elected to this position; another judicial candidate received a court order to leave the race; 
finally, in the race for a district seat in Philadelphia, one of the candidates had announced that he is 
abandoning his candidacy; yet all three of them remained on the respective ballots. 

A more curious story is that of the New York State Senate elections in 2010, where the name of a dead 
man appeared on the ballot (he received 828 votes) |14) . 

A quantitative Gibbard-Satterthwaite theorem for one voter. A major part of the work in proving 
Theorem 11.21 is devoted to understanding functions of a single voter, essentially proving a quantitative 
Gibbard-Satterthwaite theorem for one voter. This can be formulated as follows. 

Theorem 1.4. Suppose / : 5*^ — )■ [k] is a SCF on n — 1 voter and k > S alternatives which satisfies 
D (/, NONMANIP) > e. Then 

(2) P (a e M (/)) >¥{ae Ah (/)) > P [e, , 

3 

for some polynomial p, where a £ Sk is selected uniformly. In particular, we show a lower bound of -^^f^ia . 

We note that this is a new result, which has not been studied in the literature before. 

Dobzinski and Procaccia proved a quantitative Gibbard-Satterthwaite theorem for two voters, assum- 
ing that the SCF is Pareto optimal, i.e. if all voters rank alternative a above &, then h is not elected. The 
assumption of Pareto optimality is natural in the context of classical social choice, but it is a very strong 
assumption in the context of quantitative social choice. For one, it implies that every alternative is elected 
with probability at least 1/fc^. Second, for one voter, there exists a unique Pareto optimal SCF, while the 
number of nonmanipulable SCFs is exponential in k. The assumption also prevents applying the result of 
Dobzinski and Procaccia to SCFs obtained from a SCF on many voters when the votes of all voters but two 
arc fixed (since even if the original SCF is Pareto optimal, the restricted function may not be so). In our 
proof we often deal with such restricted SCFs (where the votes of all but one or two voters are fixed), and 
this is also what led us to our quantitative Gibbard-Satterthwaite theorem for one voter. 

On NONMANIP versus NONMANIP. The quantitative Gibbard-Satterthwaite theorems of Friedgut, 
Kalai, Keller and Nisan [10, 9 , and Isaksson, Kindler and Mossel [15 involve the distance of a SCF from 
NONMANIP. Any SCF that is not in NONMANIP is manipulable (by the Gibbard-Satterthwaite theorem), 
but as some SCFs in NONMANIP are manipulable as well, ideally a quantitative Gibbard-Satterthwaite 
theorem would involve the distance of a SCF from the set of (truly) nonmanipulable SCFs, NONMANIP. 
Theorem 1 1 . 2 1 addresses this concern, as it involves the distance of a SCF from NONMANIP. This is done via 
the following reduction theorem that implies that whenever one has a quantitative Gibbard-Satterthwaite 
theorem involving D (/, NONMANIP) , this can be turned into a quantitative Gibbard-Satterthwaite theorem 
involving D (/, NONMANIP). 

Theorem 1.5. Suppose f is a SCF on n voters and k > 3 alternatives for which D (/, NONMANIP] < a. 
Then either 

(3) D (/, NONMANIP) < lOOn^fc^a^/^ 
or 

(4) P(aeM(/))>P(aeA/3(/))>a. 
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The proof of this rcsuh also uses Theoreni ll.4[ our quantitative Gibbard-Satterthwaite theorem for one 
voter. 

A note on our quantitative bounds. The lower bounds on the probability of manipulation derived in 
Theorems II. 2[ II. 4[ and various results along the way, are not tight. Moreover, we do not believe that our 
techniques allow us to obtain tight bounds. Consequently, we did not try to optimize these bounds, but 
rather focused on the qualitative result: obtaining polynomial bounds. 

1.4 Proof techniques 

In our proof we combine ideas from both Friedgut, Kalai, Keller and Nisan [TOl [5] and Isaksson, Kindler and 
Mossel |15) , and in addition we use a reverse hypercontractivity lemma that was applied in the proof of a 
quantitative version of Arrow's theorem by Mossel [18j . (Reverse hypercontractivity was originally proved 
and discussed by Borell m, and was first applied by Mossel, O'Donnell, Regev, Steif and Sudakov [H].) Our 
techniques most closely resemble those of Isaksson et al. [TS]; here the authors used a variant of the canonical 
path method to show the existence of a large interface where three bodies touch. Our goal is also to come 
to this conclusion, but we do so via different methods. 

We first present our techniques that achieve a lower bound for the probability of manipulation that 
involves factors of (see Theorem 13.11 in Section [3]) , and then describe how a refined approach leads to a 
lower bound which has inverse polynomial dependence on k (see Theorem 17.11 in Section [7]) . 

Rankings graph and applying the original Gibbard-Satterthwaite theorem. As in Isaksson et 
al. 15 , think of the graph G = {V,E) having vertex set V = S]^, the set of all ranking profiles, and let 
(cr, a') e i? if and only if a and a' differ in exactly one coordinate. The SCF f : S"^ ^ [k] naturally partitions 
V into k subsets. Since every manipulation point must be on the boundary between two such subsets, we 
are interested in the size of such boundaries. 

For two alternatives a and 6, and voter i, denote by B^'^ the boundary between /^^ (a) and /^^ {h) 
in voter i. A lemma from Isaksson et al. [15] tells us that at least two of the boundaries are large; in the 
following assume that these are Bl'^ and B'^''^ . Now if a ranking profile cr lies on both of these boundaries, 
then applying the original Gibbard-Satterthwaite theorem to the restricted SCF on two voters where we fix 
all coordinates of a except the first two, we get that there must exist a manipulation point which agrees with 
a in all but the first two coordinates. Consequently, if we can show that the intersection of the boundaries 
B'^'^ and B'^''^ is large, then we have many manipulation points. 

Fibers and reverse hypercontractivity. In order to have more "control" over what is happening at the 
boundaries, we partition the graph further — this idea is due to Friedgut et al. [TUJ[n]. Given a ranking profile 
cr and two alternatives a and 6, cr induces a vector of preferences x""'^ (cr) G {—1, 1}" between a and h. For 
a vector z""'^ G {—1, 1}" we define the fiber with respect to preferences between a and b, denoted by F (z"'''), 
to be the set of ranking profiles for which the vector of preferences between a and b is z"-'^. We can then 
partition the vertex set V into such fibers, and work inside each fiber separately. Working inside a specific 
fiber is advantageous, because it gives us the extra knowledge of the vector of preferences between a and b. 

We distinguish two types of fibers: large and small. We say that a fiber w.r.t. preferences between a and 
b is large if almost all of the ranking profiles in this fiber lie on the boundary B^'^, and small otherwise. Now 
since the boundary B^'^ is large, either there is big mass on the large fibers w.r.t. preferences between a and 
b or big mass on the small fibers. This holds analogously for the boundary B!^''^ and fibers w.r.t. preferences 
between a and c. 

Consider the case when there is big mass on the large fibers of both B^'^ and i?2'^- Notice that for a 
ranking profile cr, being in a fiber w.r.t. preferences between a and b only depends on the vector of preferences 
between a and b, x°''^ (cr), which is a uniform bit vector. Similarly, being in a fiber w.r.t. preferences between 
a and c only depends on a;°''^(cr). Moreover, we know the exact correlation between the coordinates of 
x°''^ {a) and a;'''^(cr), and it is in exactly this setting where reverse hypercontractivity applies (see Lemma 
12.41 for a precise statement), and shows that the intersection of the large fibers of B'^"^ and -62'*^ is also large. 
Finally, by the definition of a large fiber it follows that the intersection of the boundaries B'^'^ and i?2'^ is 
large as well, and we can finish the argument using the Gibbard-Satterthwaite theorem as above. 

To deal with the case when there is big mass on the small fibers of B'^'^ we use various isoperimetric 
techniques, including the canonical path method developed for this problem by Isaksson et al. [15J. In 
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particular, we use the fact that for a small fiber for B^'^, the size of the boundary of i?^'** in the small fiber 
is comparable to the size of B'l'^ in the small fiber itself, up to polynomial factors. 

A refined geometry. Using this approach with the rankings graph above, our bound includes factors 
(see Theorem 13.11 in Section [3]) . In order to obtain inverse polynomial dependence on k (as in Theorem 17.11 
in Section [71), we use a refined approach, similar to that in Isaksson et al. jl5. . Instead of the rankings graph 
outlined above, we use an underlying graph with a different edge structure: (cr, cr') € E if and only if cr and 
cr' differ in exactly one coordinate, and in this coordinate they differ by a single adjacent transposition. In 
order to prove the refined result, we need to show that the geometric and combinatorial quantities such as 
boundaries and manipulation points are roughly the same in the refined graph as in the original rankings 
graph. In particular, this is where we need to analyze carefully functions of one voter, and ultimately prove 
a quantitative Gibbard-Satterthwaite theorem for one voter. 

A detailed outline of the proof of Theorem 13. II can be found in Section ISTTl while an overview of the changes 
in the proof to get to Theorem 17. 1[ in which the lower bound has inverse polynomial dependence on fc, is 
given in Section |4l Both of these theorems assume that D (/, NONMANIP) > e; the reduction theorem 
(Theorem ll.Sp then allows us to weaken this assumption to D (/, NONMANIP) > e (the proof of Theorem ll.51 
can be found at the end of the paper, in Section [5]). 

I. 5 Organization of the paper 

The rest of the paper is outlined as follows. We introduce necessary preliminaries (definitions and previous 
technical results) in Section [5] We then proceed by proving Theorem 13.11 in Section [31 which is weaker 
than Theorem 1 1.2 1 in two aspects: first, the condition D (/, NONMANIP) > e is replaced with the stronger 
condition D (/, NONMANIP) > e, and second, we allow factors of in our lower bounds for P (cr G M (/)). 
We continue by explaining the necessary modifications we have to make in the refined setting to get inverse 
polynomial dependence on k in Section [31 Additional preliminaries necessary for the proofs of Theorems 17. 11 

II. 41 and 11.51 are in Section [5l while the remaining sections contain the proofs of these theorems. We prove 
Theorem ll.4l in Section [HI Theorem l7.1l in Section [71 and Theorem 11.51 and Theorcm ll.2l in Section [51 Finally 
we conclude with some open problems in Section [9l 

2 Preliminaries: definitions and previous technical results 

In this section we introduce some definitions and previous technical results that we use throughout the paper. 
In particular, we introduce everything that is needed to prove our first theorem, Theorcm l3.1[ and in Section 
[5lwe present additional preliminaries needed for the rest of the theorems. 

2.1 Boundaries and influences 

For a general graph G = (V, E), and a subset of the vertices A C G, we define the edge boundary of A as 

de (A) = {(u, v)eE:ueA,v(^A}. 

We also define the boundary (or vertex boundary) of a subset of the vertices ^ C G to be the set of vertices 
in A which have a neighbor that is not in A: 

d (A) ~ {u £ A : there exists v ^ A such that (u, v) G E} . 

li u E d (A), we also say that u is on the edge boundary of A. 

As discussed in Section [L4l we can view the ranking profiles (which are elements of S*^) as vertices of 
a graph — the rankings graph — where two vertices are connected by an edge if they differ in exactly one 
coordinate. The SCF / naturally partitions the vertices of this graph into k subsets, depending on the value 
of / at a given vertex. Clearly, a manipulation point can only be on the edge boundary of such a subset, 
and so it is important to study these boundaries. In this spirit, we introduce the following definitions. 
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Definition 8 (Boundaries). For a given SCF f and a given alternative a G [k], we define 

if) = {a e 5;': / (a) = a} , 

the set of ranking profiles where the outcome of the vote is a. The edge boundary of this set is denoted by 
B°' (/) ; (/) — de (H" (/)). This boundary can be partitioned: we say that the edge boundary of H"^ (/) 
in the direction of the i*^ coordinate is 

B1{f)^{{a,<j')eB'^{f):a,^a'^}. 

The boundary B"^ {f) can be therefore written as B°' (f) = U"^]^i?f (/). We can also define the boundary 
between two alternatives a and b in the direction of the i*^ coordinate: 

Bf{f)^{{a,a')eBt{f):f{a')^b}. 

We also say that a G Bf (/) is on the boundary B°^'^ (/) if there exists cr' such that (cr, cr') G B"^'^ (/). 
Definition 9 (Influences). We define the influence of the i*^ coordinate on f as 

Inf, (/) = P (/ [a) ^ f (aW)) - P ((a, G U^iS? (/)) , 

where a is uniform on SJ^ and ct^*^ is obtained from a by rerandomizing the i*'' coordinate. Similarly, we 
define the influence of the i*'' coordinate with respect to a single alternative a G [k] or a pair of alternatives 
a,b G [k] as 

Infn/)=p(/(a)=a,/(aW) ^ a) = P ((a, a«) G (/)' 



and 

respectively. 
Clearly 



Inf: 



fif)=^{f{<y)=aj{a^'^)=b) =P((a,a(')) GSr^/) 



Inf,(/)=^Infn/)= E I^'^/)- 

a=l a,bGlk]:a^b 

Most of the time the specific SCF / will be clear from the context, in which case we omit the dependence 
on /, and write simply B"" = B"" (/), Bf = Bf (/), etc. 

2.2 Large boundaries 

The following lemma from Isaksson, Kindler and Mossel [TSl Lemma 3.1.] shows that there are some 
boundaries which are large (in the sense that they are only inverse polynomially small in n, k and e~^) — our 
task is then to find many manipulation points on these boundaries. 



Lemma 2.1. Fix k > 3 and f : [k] satisfying D(/, NONMANIP) > e. Then there exist distinct 

i, j G [n] and {a, 6}, {c, d} C [k] such that c ^ {a, 6} and 

2.3 General isoperimetric results 

Our rankings graph is the Cartesian product of n complete graphs on fc! vertices. We therefore use isoperi- 
metric results on products of graphs — see |13] for an overview. In particular, the edge-isoperimetric problem 
on the product of complete graphs was originally solved by Lindsey in 1964 fVI^ : 

Theorem 2.2 (Lindsey |17)). The edge-isoperimetric problem on Kn^ x x ••• x Kn^, a product of 
complete graphs with ni < n2 < ■ ■ ■ < Ud, has lexicographic nested solutions. 

Corollary 2.3. If A C Kk x ■ ■ ■ x Kk (n copies) and \A\ < (l - i) k'\ then \de {A)\ > \A\. 
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2.4 Fibers 

In our proof we need to partition the graph even further — this idea is due to Friedgut, Kalai, Keller, and 
Nisan [inilS]. 

Definition 10. For a ranking profile a ^ define the vector 

of preferences between a and b, where x'^'^ (cr) = 1 if a > b, i.e. voter i prefers a over b, and x^'^ (cr) = — 1 
otherwise. 

Definition 11 (Fibers). For a pair of alternatives a,b G [k] and a vector z'^'^ € 1}"; write 

F{z'''^) {a : x"'^ {a) = z"'^] . 

We call the F (z"''') fibers with respect to preferences between a and b. 

So for any pair of alternatives a, b, we can partition the ranking profiles according to its fibers: 

Sl^ U P i'"^') ■ 

2". '£{-1,1}" 

Given a SCF /, for any pair of alternatives a,b Cz [k] and i G [n], we can also partition the boundary 
-^i'^ if) according to its fibers. There are multiple, slightly different ways of doing this, but for our purposes 
the following definition is most useful. Define 



(z"'^) := |(T e F {z"^'') : f (cr) = a, and there exists a' s.t. {a,a') e , 



where we omit the dependence of Bi (z^^'') on /. So Bi (z'^^'') C F (z"'^) is the set of vertices on the given 
fiber for which the outcome is a and which lies on the boundary between a and b in direction i. We call the 
sets of the form Bi (z"''') fibers for the boundary B"^'^ (again omitting the dependence on / of both sets). 
We now distinguish between small and large fibers for the boundary B^'^ . 

Definition 12 (Small and large fibers). We say that the fiber Bi (z"''') is large if 

(6) e 5^ >i-j^. 

and small otherwise. 

We denote by Lg '''^ the union of large fibers for the boundary B^'^ , i.e. 

Lg (s,"''') {cr : B, (x"^'' {a)) is a large fiber, and a e B, {x"''' (cr))} 

and similarly, we denote by Sm (^Bf'''^ the union of small fibers. 



We remark that what is important is that the fraction appearing on the right hand side of ([5]) is a 
1 

k 



polynomial of — , t: and e — the specific polynomial in this definition is the end result of the computation in 



the proof. 

Finally, for a voter i and a pair of alternatives a, & G [fc], we define 

{a : Bi (x"''' {a)) is a large fiber} 

So this means that 



(7) P((T e U^a,.B, (z"'' 



a e F"-' ) > I - 
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2.5 Boundaries of boundaries 

Finally, we also look at boundaries of boundaries. In particular, for a given vector z"^'^ of preferences between 
a and b, we can think of the fiber F (z"''') as a subgraph of the original rankings graph. When we write 
d [Bi we mean the boundary of Bi (2°''') in the subgraph of the rankings graph induced by the fiber 

F {z"'''). That is, 

d {B, (z"'^)) = {ct e (z"^'') : 3 TT e F (z"-'') \ Bi (z'^-'') s.t. a and tt differ in exactly one coordinate}. 

2.6 Reverse hypercontractivity 

We use the following lemma about reverse hypercontractivity from Mossel [TH] . 

Lemma 2.4. Let x,y E {— 1,1}" be distributed uniformly and {(a^i, j/i)}"^]^ are independent. Assume that 
E [xi) = E (j/i) — /or a/^ i and that |E < p. Let Bi, B2 C {—1, 1}" be two sets and assume that 

P(Si)>e~"', P{B2)>e'^'\ 

Then 

F{x&Bi,yeB2)>cxpi ^-—J±Ji\ . 

In particular, ifV{Bi) > e and¥{B2) > s, then 

f{x e Bi,ye B2) > e^. 

2.7 Dictators and miscellaneous definitions 

For a ranking profile a — (cti, . . . , cr„) we sometimes write cr_i for the collection of all coordinates except 
the i'^ coordinate, i.e. a — {ai,a-i). Furthermore, we sometimes distinguish two coordinates, e.g. we write 
cr = ((Ti, CTj, cr_{i_i}). 

Definition 13 (Induced SCF on one coordinate). Let fa_. denote the SCF on one voter induced by f by 
fixing all voter preferences except the i*^ one according to (J-i- I.e., 

U-A-) 

Recall Definition [2] of a dictator on a subset. 

Definition 14 (Ranking profiles giving dictators on a subset). For a coordinate i and a subset of alternatives 
HQ [k], define 

:={fT-, :/._.(•) = t0PH(-)}. 
Also, for a pair of alternatives a and b, define 

A (a, 6):= U Df. 

H:{a,b}CH,\H\>3 

3 Inverse polynomial manipulability for a fixed number of alter- 
natives 

Our goal in this section is to demonstrate the proof techniques described in Section [1.41 We prove here the 
following theorem fTheorem l3.1l below). which is weaker than our main theorem, Theorem ll.21 in two aspects: 
first, the condition D (/, NONMANIP) > e is replaced with the stronger condition D (/, NONMANIP) > e, 
and second, we allow factors of in our lower bounds for P (cr € M (/)). The advantage is that the proof 
of this statement is relatively simpler. We move on to getting a lower bound with polynomial depen- 
dence on k in the following sections, and finally we replace the condition D (/, NONMANIP) > e with 
D (/, NONMANIP) > e in Section El 
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Theorem 3.1. Suppose we have n > 2 voters, fc > 3 alternatives, and a SCF f : SJ^ ~> [k] satisfying 
D (/, NONMANIP) > e. Then 

(8) P(aeAf(/))>p(^e,i 1 

for some polynomial p, where a € SJ^ is selected uniformly. In particular, we show a lower bound of 
An immediate consequence is that 

11 



P ((cr, a') is a manipulation pair for f) > q { e, , 

\ n k\ 

for some polynomial q, where a G S"^ is selected uniformly, and a' is obtained from a by uniformly selecting 
a coordinate i £ {1, . . . ,n} and resetting the i*'' coordinate to a random preference. In particular, the specific 
lower bound for P (cr € ill (/)) implies that we can take <? (e, -^i ■^) 4^8^12 (-^1)5 ■ 

First we provide an overview of the proof of Theorem 13.11 in Section 13.11 In this overview we use 
adjectives such as "big" , and "not too smaU" to describe probabihties — here these are all synonymous with 
"has probability at least an inverse polynomial of n, kl, and £~^". 



3.1 Overview of proof 

The tactic in proving Theorem 13. II is roughly the following: 

• By Lemma [231 we know that there are at least two boundaries which are big. W.l.o.g. we can assume 
that these are either B'l''' and -82 "^, or B"''' and iJj''^ with {a, b} fl {c, d} = 0. Our proof works in both 
cases, but we continue the outline of the proof assuming the former case — this is the more interesting 
case, since the latter case has been solved already by Isaksson et al. [15] . 

• We partition 5°'^ according to its fibers based on the preferences between a and b of the n voters, just 
like as described in Section [2] Similarly for B!^''^ and preferences between a and c. 

• As in Section [21 we can distinguish small and large fibers for these two boundaries. Now since _B°'^ is 
big, either the mass of small fibers, or the mass of large fibers is big. Similarly for iJj'"^. 

• Suppose first that there is big mass on large fibers in both _B°''' and B!^ '^. In this case the probability 
of our random ranking a being in F^'^ is big, and similarly for Being in F^''^ only depends on the 
vector x""'^ (a) of preferences between a and b, and similarly being in f^''^ only depends on the vector 
a;"''^ (cr) of preferences between a and c. We know the correlation between x""'^ (a) and x"'"^ (ct) and 
hence we can apply reverse hypercontractivity fLemma l2.4p . which tells us that the probability that a 
lies in both F^'^ and F^'^ is big as well. If cr G F^'^, then voter 1 is pivotal between alternatives a and 
b with big probability, and similarly if cr G F2''^, then voter 2 is pivotal between alternatives a and c 
with big probability. So now we have that the probability that both voter 1 is pivotal between a and b 
and voter 2 is pivotal between a and c is big, and in this case the Gibbard-Satterthwaite theorem tells 
us that there is a manipulation point which agrees with this ranking profile in all except for perhaps 
the first two coordinates. So there are many manipulation points. 

• Now suppose that the mass of small fibers in B^'^ is big. By isoperimetric theory, the size of the 
boundary of every small fiber is comparable (same order up to poly~^ (e"^,ri, fc!) factors) to the size 
of the small fiber. Consequently, the total size of the boundaries of small fibers is comparable to the 
total size of small fibers, which in this case has to be big. 

We then distinguish two cases: either we are on the boundary of a small fiber in the first coordinate, 
or some other coordinate. If cr is on the boundary of a small fiber in some coordinate j 7^ 1, then the 
Gibbard-Satterthwaite theorem tells us that there is a manipulation point which agrees with a in all 
coordinates except perhaps in coordinates 1 and j. If our ranking profile a is on the boundary of a 
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small fiber in the first coordinate, then either there exists a manipulation point which agrees with a 
in all coordinates except perhaps the first, or the SCF on one voter that we obtain from / by fixing 
the votes of voters 2 through n to be cr„i must be a dictator on some subset of the alternatives. So 
either we get sufficiently many manipulation points this way, or for many votes of voters 2 through n, 
the restricted SCF obtained from / by fixing these votes is a dictator on coordinate 1 on some subset 
of the alternatives. 

Finally, to deal with dictators on the first coordinate, we look at the boundary of the dictators. Since 
D (/, NONMANIP) > e, the boundary is big, and we can also show that there is a manipulation point 
near every boundary point. 

• If the mass of small fibers in B^ '^ is big, then we can do the same thing for this boundary. 



3.2 Division into cases 

For the remainder of Section [31 let us fix the number of voters n > 2, the number of alternatives fc > 3, 
and the SCF /, which satisfies D (/, NONMANIP) > e. Accordingly, we typically omit the dependence of 
various sets (e.g. boundaries between two alternatives) on /. 

Our starting point is Lemma |2. II W.l.o.g. we may assume that the two boundaries that the lemma gives 
us have i = \ and j = 2, so the lemma tells us that 



where a is uniform on the ranking profiles, and ad) is obtained by rerandomizing the first coordinate. This 
also means that 



2e 

and similar inequalities hold for the boundary 32"" . The following lemma is an immediate corollary. 
Lemma 3.2. Either 



■.d 



(9) P(ae Sm(Bi"-'')) > ^ 



fc3 



or 



(10) 

and the same can be said for the boundary B!^''^. 

We distinguish cases based upon this: either ^ holds, or ^ holds for the boundary B^''^, or ([TU)) holds 
for both boundaries. We only need one boundary for the small fiber case, and we need both boundaries only 
in the large fiber case. So in the large fiber case we must differentiate between two cases: whether d e {a, 6} 
or d ^ {a, b}. First of all, in the d ^ {a, 6} case the problem of finding a manipulation point with not too 
small (i.e. inverse polynomial in n, kl and e~^) probability has already been solved in [15' . But moreover, 
we will see that if c? ^ {a, b} then the large fiber case cannot occur — so this method of proof works as well. 

In the rest of the section we first deal with the large fiber case, and then with the small fiber case. 

3.3 Big mass on large fibers 

We now deal with the case when 



(11) 
and also 

(12) ^'^^^^M)^^- 

As mentioned before, we must differentiate between two cases: whether d G {a, b} or d ^ {a, b}. 



e 
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3.3.1 Case 1 



Suppose d e {a, 6}, in which case we may assume w.l.o.g. that d — a 
Lemma 3.3. // 

(13) P(aeLg(i?^^)) > 

then 



4j and P(aeLg(i?r))>4l' 



(14) 

Proof. By ([T^ we have that 



P (o- e Af ) > 



2n3fc9(fc!)' 



> 



and 



'(ae^^r)> 



We know that 



(15) 



E 



(a.h / \ a.c / > 
(CT)Xi^ (fJ, 



1/3, and so by reverse hypercontractivity (Lemma 12. 4p we have that 



> 



Recah that we say that a is on the boundary 5°' if there exists a' such that {a, a') G B^' . If cr G F^' , 
then with big probability cr is on the boundary i?^'*", and if G -^2*"^, then with big probabihty a is on 
the boundary B!^'"^. Using this and (jisp we can show that the probabihty of a lying on both the boundary 
B^'^ and the boundary B!^''^ is big. Then we are done, because if a lies on both B"^'^ and i?2 "^, then by the 
Gibbard-Satterthwaite theorem there is a o" which agrees with a on the last n — 2 coordinates, and which 
is a manipulation point, and there can be at most (fc!)^ ranking profiles that give the same manipulation 
point. Let us do the computation; 

P (fj on B^^\ a on B^^^^ > P (cr on B^-\ <j on a G F^'^ D F^-'^ 

> P (a G Ff^^ n - P (ct G F^^'' n F^-^, a not on B^'^^ - P (fi G n F:^^\ a not on B^' 

The first term is bounded below via (|15p . while the other two terms can be bounded using ([7]): 



a G F^'^ n F2'''^ cr not on Bl'^] < P ( cr G cr not on B^'^] <V [a not on B^'^ 



and similarly for the other term. Putting everything together gives us 



a on Bj^ ,a on B^ I > 



a G < ^ 



which by the discussion above implies ([T4|) . 
3.3.2 Case 2 

Lemma 3.4. If d ^ {a,b}, then (1111) anrf (jl2p cannot hold simultaneously. 
Proof. Suppose on the contrary that PT]) and IT^ do both hold. Then 



□ 



fxG Ff^" > 



and 



as before. Since {a, b} D {c, d} = 0, {a G Ff''} and {a G F^"''} are independent events, and so 



(aGF- 



a G F^^'') P (a G F:^A > 



13 



In the same way as before, by the definition of large fibers this implies that 



a on a on B"/) > > 0, 



but it is clear that 

P(^cr on Bl'^c^ on Bj''^) = 0> 



since a on B^' and on B2 requires / (a) e {a, b} {c, d} = 0. So we have reached a contradiction. □ 
3.4 Big mass on small fibers 

We now deal with the case when (|9]) holds, i.e. when we have a big mass on the small fibers for the boundary 
B"' . We formalize the ideas of the outline described in Section [Ol in a series of statements. 

First, we want to formalize that the boundaries of the boundaries are big, when we are on a small fiber. 

Lemma 3.5. Fix coordinate 1 and the pair of alternatives a, b. Let z°-'^ he such that Bi {z""'^) is a small 
fiber for B^'^ . Then, writing B = Bi we have 

and 

(16) ^("^^(^))^2;ifc!^^^^^)' 

where both the edge boundary de (B) and the boundary d (B) are with respect to the induced subgraph F (z"''') , 
which is isomorphic to Kk\/27 Cartesian product of n complete graphs of size k\/2. 

Proof. We use CoroUarv 12.31 with k replaced by fc!/2 and the set A being either B or B= := F (z"''') \ B. 
Suppose first that \B\ < (l - (fc!/2)". Then \de {B)\ > \B\. Suppose now that \B\ > (l - I,) (A;!/2)". 

Since we are in the case of a small fiber, we also know that \B\ < ^1 — j:^TpT^ (A:!/2)". Consequently, we get 

\d^iB)\ = \dAB^)\>m>^\B\, 

which proves the first claim. 

A ranking profile in F (z"-'') has (fc!/2 - 1) n < nkl/2 neighbors in F (z"'''), which then implies ([T6| . □ 

Corollary 3.6. // JH]) holds, then 

Proof. Using the previous lemma and © we have 
FLe\Jd{B,{z-^'))\ =Y,P{aed{B,{z-''')))> F {a e d {B, {z-^'))) 



'':Bi(2»-f')CSm(Si"'') 

z<'.':Bi(2".'')CSm(B°''') 

- 2n^k^^kl' 

Next, we want to find manipulation points on the boundaries of boundaries. 
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Lemma 3.7. Suppose the ranking profile a is on the boundary of a fiber for B^'^ , i.e. 

aeUdiB.iz^^")). 

Then either (T_i G Di {a,h), or there exists a manipulation point a which differs from a in at most two 
coordinates, one of them being the first coordinate. 

Proof. First of all, by our assumption that a is on the boundary of a fiber for B^'^ , we know that a E Bi (z°'^) 
for some z""' , which means that there exists a ranking profile a' — {a[,(7-i) such that (c, o"') G Bl'". We 

may assume a > b and > a, or else either a or a is a manipulation point. 

Now since a G d (z"''')) we also know that there exists a ranking profile tt = {ttj, cr_j 
Bi (z"-'') for some j G [k]. We distinguish two cases: j ^ 1 and j — 1. 

Case 1: j ^ 1. What does it mean for tt = {ttj, <J-j) to be on the same fiber as cr, but for tt to not be in 
Bi (z"''')? First of all, being on the same fiber means that aj and Hj both rank a and b in the same order. 
Now vr ^ i?i (z"''') means that 

• either / (tt) ^ a; 

• or / (tt) — a and / {n'l, 7r_i) 7^ b for every t:[ G Sk. 

If / (tt) — b, then either u or tt is a manipulation point, since the order of a and b is the same in both aj 
and TTj (since a and tt are on the same fiber). 

Suppose / (tt) = c ^ {a, b}. Then we can define a SCF function on two coordinates by fixing all 
coordinates except coordinates 1 and j to agree with the respective coordinates of a — letting coordinates 1 
and j vary we get a SCF function on two coordinates which takes on at least three values (a, b, and c), and 
does not only depend on one coordinate. Now applying the Gibbard-Satterthwaite theorem we get that this 
SCF on two coordinates has a manipulation point, which means that our original SCF / has a manipulation 
point which agrees with a in all coordinates except perhaps in coordinates 1 and j. 

So the final case is that / (tt) = a and / {n'l, 7r_i) ^ 6 for every tt'^ G Sk. In particular for tt (crj, 7r_i) = 
(tTj , we have / {tt) ^ b. Now if / (tt) = a then either a' or tt is a manipulation point, since the order 
of a and b is the same in both a'j = aj and ttj. Finally, if / (tt) = c ^ {o-,b}, then we can apply the 
Gibbard-Satterthwaite theorem just like in the previous paragraph. 

Case 2: j = 1. We can again ask: what does it mean for tt = (tti, cr_i) to be on the same fiber as cr, but 
for TT to not be in Bi (2;°''')? First of all, being on the same fiber means that ai and tti both rank a and b 
in the same order (namely, as discussed at the beginning, ranking a above 6, or else we have a manipulation 
point). Now TT ^ Bi (z"''') means that 

• either / (tt) ^ a; 

• or / (tt) = a and / {tt'i, 7r_i) 7^ b for every tt[ G Sk- 

However, we know that / (cr') = 6 and that cr' is of the form cr' = (crj, cr_]^) = (cr^, 7r_i), and so the only way 
we can have tt ^ Bi (z"''') is if / (tt) ^ a. 

If / (tt) — b, then tt is a manipulation point, since a > b and / (cr) — a. 

So the remaining case is if / (tt) = c ^ {a, b}. This means that (see Definition [T3| takes on at least 
three values. Denote hy H C [k] the range of /cr_i- Now either cr_i G C Di (a, 6), or there exists a 
manipulation point a which agrees with cr in every coordinate except perhaps the first. □ 

Finally, we need to deal with dictators on the first coordinate. 
Lemma 3.8. Assume that D (/, NONMANIP) > e. We have that either 

P(cr_i G Di {a,b)) < 



An^k^'^kV 
or 

(17) P (cr G M) > 



An^k^^ ik\)^' 
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Proof. Suppose P(o--i G Di (a, 6)) > 



which is the same as 



(18) 



E 



4r7,5fci2fc! ' 



H:{a.b}CH,\H\>3 

Note that for every H C [fc] we have 

£ < D (/, NONMANIP) < P(/((t) ^ top^ (cti)) < 1 - P (£)f ) , 

and so 

(19) F{D^)<l-e. 

The main idea is that imphes that the size of the boundary of is comparable to the size of , 
and if we are on the boundary of Di , then there is a manipulation point nearby. 

So first let us establish that the size of the boundary of is comparable to the size of . This is 
done along the same lines as the proof of Lemma [ 



Notice that ^ ■5'^ ^ , where S'^ ^ should be thought of as the Cartesian product of n — 1 copies of 
the complete graph on Sk- We apply Corollarv 12.31 with k replaced by fc! and with n — I copies, and we see 
that if £ > ^, then \de )| > \D^\. If e < -if and 1 - -if < P (Df ) < 1 - e then 



\de{D^)\ = 



de D 



> 



D^y >e\D^\ 



So in any case we have \de (-Cf^) | > e Since (t_i has {n — 1) (A:! — 1) < nk\ neighbors in S'^ ""^j we 

have that 



Consequently, by we have 



H:{a,b}CH,\H\>3 



H:{a,b}CH,\H\>3 



H:{a,b}CH,\H\>3 



--P(ct_i e Zjf) > 2. 



Next, suppose (t_i G d {D^) for some H such that {a,b} C H, \H\ > 3. We want to show that then 
there is a manipulation point "close" to (T_i in some sense. To be more precise: for the manipulation point 
(T, CT-i will agree with ct_i in all except maybe one coordinate. 

If CT_i G d (D^) , then there exist j G {2, . . . , n} and a'j such that cr'_i := (cr^-, cr_{i_j}) ^ . That is, 
/ct' ^ (•) ^ top^ (•). There can be two ways that this can happen — the two cases are outlined below. Denote 
by H' C [k] the range of fa'_^- 

Case 1: H' — H. In this case we automatically know that there exists a manipulation point & such that 
(T_i = a'_i, and so (T_i agrees with a-i in all coordinates except coordinate j. 

Case 2: H' 7^ H. W.l.o.g. suppose H'\H and let ce H'\H. (The other case when H \ H' ^ 9 
works in exactly the same way.) First of all, we may assume that f^r' ^ (•) = top^, (•), because otherwise we 
have a manipulation point just like in Case 1. 

We can define a SCF on two coordinates by fixing all coordinates except coordinate 1 and j to agree with 
(T_i, and varying coordinates 1 and j. We know that the outcome takes on at least three different values, 
since (t_i S , and \H\ > 3. 

Now let us show that this SCF is not a function of the first coordinate. Let ai be a ranking which puts 
c first, and then a. Then / (cti, ct-i) — a, but / (cti, c^j^) = c, which shows that this SCF is not a function 
of the first coordinate (since a change in coordinate j can change the outcome) . 

Consequently, the Gibbard-Satterthwaite theorem tells us that this SCF on two coordinates has a ma- 
nipulation point, and therefore there exists a manipulation point a for / such that (7_i agrees with (t_i in 
all coordinates except coordinate j. 

Putting everything together yields ([T7)). □ 
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3.5 Proof of Theorem 13.11 concluded 

Proof of Theorem \3.1\ If pTI) and hold, then we are done by Lemmas 13.31 and [ 

If not, then either ^ holds, or ^ holds for the boundary B^'^; w.l.o.g. assume that © holds. 
By Corollary 13.61 we haye 

We may assume that P (ct_i G Di (a, b)) < ^^b^j^fA/., , since otherwise we are done by Lemma [3^ Conse- 
quently, we then have 

P (^a e U a {B, (z-^)) , i D, (a, 6) j > 

We can then finish our argument using Lemma 13.71 

P (d e A/) > ^-^P I (7 e I ^Di(a,6) ) > ^. □ 

n(fc!)^ I rl 7 " 4n6fci2(fc!)3 



4 An overview of the refined proof 

In order to improve on the result of Theorem 13. 11 — in particular to get rid of the factor of — we need to 
refine the methods used in the previous section. We continue the approach of Isaksson, Kindler and Mossel 
[15) . where the authors first proved a quantitative Gibbard-Satterthwaite theorem for neutral SCFs with a 
bound involving factors of p-, and then with a refined method were able to remove these factors. 

The key to the refined method is to consider the so-called refined rankings graph instead of the general 
rankings graph studied in Section [3l The vertices of this graph are again ranking profiles (elements of SJ^), 
and two vertices are connected by an edge if they differ in exactly one coordinate, and by an adjacent 
transposition in that coordinate. Again, the SCF / naturally partitions the vertices of this graph into k 
subsets, depending on the value of / at a given vertex. Clearly a 2-manipulation point can only be on 
the edge boundary of such a subset in the refined rankings graph, and so it is important to study these 
boundaries. 

One of the important steps of the proof in Section [3] is creating a configuration where we fix all but two 
coordinates, and the SCF / takes on at least three values when we vary these two coordinates — then we can 
define another SCF on two voters and k alternatives which must have a manipulation point by the Gibbard- 
Satterthwaite theorem. The advantage of the refined rankings graph is that we can create a configuration 
where we fix all but two coordinates, and in these two coordinates we also fix all but constantly many adjacent 
alternatives, and the SCF takes on at least three values when we vary these constantly many adjacent 
alternatives in the two coordinates. Then we can define another SCF on two voters and r alternatives, where 
r is a small constant, which must have a manipulation point by the Gibbard-Satterthwaite theorem. Since 
r is a constant, we only lose a constant factor in our estimates, not factors of -g. 

We state the refined result in Theorem 17.11 which we also prove in Section [T] The proof of Theorem 
17.11 follows the outline of the proof of Theorem 13.11 we know that there are at least two refined boundaries 
which are big (by Isaksson et al. [15J; we partition them according to their fibers; we distinguish small and 
large fibers; and we consider two cases: the small fiber case and the large fiber case. The ideas in both cases 
are roughly the same as in Section |3l except the proofs are more involved. There is, however, one major 
difference in the small fiber case, which is the following. 

The difficulty is dealing with the case when we are on the boundary of a small fiber in the first coordinate. 
Suppose a — (tri, C-i) is on such a boundary. We know that there are kl ranking profiles which agree with a 
in coordinates 2 through n. The difficulty comes from the fact that — in order to obtain a polynomial bound 
in k — we are only allowed to look at a polynomial number (in k) of these ranking profiles when searching 
for a manipulation point. If there is an r-manipulation point among them for some small constant r, then 
we are done. If this is not the case then a is what we call a local dictator on some subset of the alternatives 
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in coordinate 1. We say that cr is a local dictator on some subset H C [k] of the alternatives in coordinate 
1 if the alternatives in H are adjacent in cti, and permuting the alternatives in H in every possible way in 
the first coordinate, the outcome of the SCF / is always the top-ranked alternative in H. 

So instead of dealing with dictators on some subset in coordinate 1, as in Section [31 we have to deal with 
local dictators on some subset in coordinate 1. This analysis involves essentially only the first coordinate, in 
essence proving a quantitative Gibbard-Satterthwaite theorem for one voter. As discussed in Section [1.31 this 
has not been studied in the literature before, and, moreover, we were not able to utilize previous quantitative 
Gibbard-Satterthwaite theorems to solve this problem easily. Hence we separate this argument from the rest 
of the proof of Theorem 17.11 and formulate a quantitative Gibbard-Satterthwaite theorem for one voter, 
Theorem 11.41 which is proven in Section [51 This proof forms the backbone for the proof of Theorem 17.11 
which is then proven in Section [T] 

5 Refined rankings graph — introduction and preliminaries 

This section presents the definitions and results that are needed for the rest of the paper, in the proofs of 
Theorems [711 [Lil and [T31 

5.1 Transpositions, boundaries, and influences 

Definition 15 (Adjacent transpositions). Given two elements a, G [k], the adjacent transposition [a : b] 
between them is defined as follows. If a € Sk has a and b adjacent, then [a : b]a is obtained from a by 
exchanging a and b. Otherwise [a : b]a ^ a. 

We let T denote the set of all k {k — 1) /2 adjacent transpositions. 

For a G SJ^, we let [a : b]^ a denote the ranking profile obtained by applying [a : b] on the i*'' coordinate of 
a while leaving all other coordinates unchanged. 

Definition 16 (Boundaries). For a given SCF f and a given alternative a G [k], we define 

H-{f) = {<jeSi::f{<7)^a}, 

the set of ranking profiles where the outcome of the vote is a. The edge boundary of this set (with respect to 
the underlying refined rankings graph) is denoted by B""'^ (/) .• i?"'-^ (/) = de [H"" (/)). This boundary can 
be partitioned: we say that the edge boundary of H"" (/) in the direction of the i*'' coordinate is 

B^^ if)^{ia,a')eB^-^^ {f) : a, ^ a[] . 

The boundary _B" (/) can be therefore written as B"'^ (/) === Uf^i^"'^ (/). We can also define the boundary 
between two alternatives a and b in the direction of the i*'' coordinate: 

Bf'' {f) = [{a,o')GBf{f):f{a') = b]. 

Moreover, we can define the boundary between two alternatives a and b in the direction of the i"' coordinate 
with respect to the adjacent transposition z £ T: 

Bf'' if) = {i<y,<y') e B^^ (/) : a' ^ z.a,f{a') = b] . 

We also say that a is on the boundary b!^'^'^ (/) if (cr, z^ct) G B^'^'^ (/). Clearly we have 

Bt''''''^{f)=\jBf'^{f). 

Definition 17 (Influences). Civen z E T, we define 

Inff^^(/)=p(/(a)=a,/(aW) =fo) 
Infr(/)=P(/(a)=a,/(a«) ^a) 
lnif'^{f) = Y,h.if'^f), 
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where a is uniformly distributed in S"^ and cr^'^ is obtained from a by rerandomizing the i*"^ coordinate CTt in 
the following way: with probability 1/2 we keep it as Ui, and otherwise we replace it by zui. 

Note that for a /b, 



Bf'^if) 



Again, most of tlie time tlie specific SCF / will be clear from the context, in which case we omit the 
dependence on /. 

5.2 Manipulation points on refined boundaries 

The following two lemmas from Isaksson, Kindler and Mossel |15) identify manipulation points on (or close 
to) these refined boundaries. 

Lemma 5.1. fldl Lemma 7.1.] Fix f : ^ [k], distinct a,b G [k] and (cr, tt) G j^^-'^.t xhen either 
cr, = [a : bjTTj, or one of a and tt is a 2 -manipulation point for f . 

Lemma 5.2. 115, Lemma 7.2.] Fix f : SJ! —?' [fc] and points (J,TT,fi G S}! such that (cr, tt) G B"]"^'^ , 
(/i, tt) G Bj'^'"^ where a,b,c are distinct and i 7^ j. Then there exists a i -manipulation point ly £ SJ} for 
f such that Vk — T^k for k ^ {i,i} and Vi is equal to at or iTi except that the position of c may be shifted 
arbitrarily and Vj is equal to fij or nj except that the position of a may be shifted arbitrarily. 

5.3 Large refined boundaries 

An essential result that will be our starting point in Section [7] is the following lemma, again from Isaksson, 
Kindler and Mossel [15], which shows that there are large refined boundaries (or else we have a lot of 
2-manipulation points automatically). 



Lemma 5.3. [15i Lemma 7.3.] Fix k > 3 and f : ^ [k] satisfying D(/, NONMANIP) > e. Let a be 
uniformly selected from S"^ . Then either 

(20) P('TGM2(/))> J^, 

or there exist distinct i,j G [n] and {a, 6}, {c, d} C [fc] such that c ^ {a, b} and 

(21) lnff''^''--''\f)>^ and lnff'^^--'\f)>^. 

nk' ■' nk' 

5.4 Fibers 

We again use fibers F (2;"''') as defined in Definition [TT] However, we need more than this. We note that the 
following definitions only apply in Section [7] i.e. when when we have at least two voters; in Section [S] when 
we have only one voter, things are simpler. 

Given the result of Lemma |5.3[ our primary interest is in the boundary ^"'''■['^ ''l , Por ranking profiles on 
this boundary, we know that the alternatives a and b are adjacent in coordinate i — so we know more than 
just the preference between a and b in coordinate i. Consequently we would like to divide the set of ranking 
profiles with a and b adjacent in coordinate i according to the preferences between a and b in all coordinates 
except coordinate i. The following definitions make this precise. 

As done in Section 12.71 for ranking profiles, we can write x"^'^ = x"',^ (cr) for the vector of preferences 
between a and b for all coordinates except coordinate i, i.e. the whole vector of preferences between a and b 

is x^^'ia)=[xfia),x^_:':i^))- 

We can define F [z^^j analogously to F (z"'^): 



(a,b\ \ a.b / \ a.b 1 
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We also define the subset of F ^ wliere a and b are adjacent in coordinate i, witli a above b: 

F (^z°ji^ '■= ^cr G F (^z"^^^ : a and b are adjacent in coordinate i, witlr a above 6| . 

Given a SCF /, for any pair of alternatives a,b £ [k] and coordinate i € [n], we can also partition the 
boundary iJ^"' (/) according to its fibers. There are multiple, slightly different ways of doing this, but for 
our purposes the following definition is most useful. 

Define 

(z!f ) ■.= [<j£F (z^f ) : / (a) = a, / {[a : b]^ a) = b} , 

where we omit the dependence of Bi (^z°^f^ on /. We call sets of the form Bi {^z^^f^ C F {^z^^f^ fibers for 
the boundary ]^'^'^'^°"^\ 

We now distinguish between small and large fibers for the boundary ]^°-'^'^°-^\ 



Definition 18 (Small and large fibers). We say that the fiber Bi (^z"',- j P j large if 

where 7 — jQ^^TpT; o.'rid small otherwise. 

As before, we denote by Lg ^^^'''^■[" ''l^ ^/jg union of large fibers for the boundary ^"''''['^■^1^ j 

Lg(i?f''^[-''l) := y i?,(z^f), 

Bii^z^^^ is a large fiber 

and similarly, we denote by Sm ^i?^'''''''" ''^^ the union of small fibers. 

As in Definition 1121 we remark that what is important is that 7 is a polynomial of and e — the 

specific polynomial in this definition is the end result of the computation in the proof. 

The following definition is used in Section [7.31 in dealing with the large fiber case in the refined setting. 

Definition 19. For a coordinate i and a pair of alternatives a and b, define F^'^ to be the set of ranking 
profiles a such that x"''' (cr) satisfies 

P (/ (a) = top{,,,j [di) \deF (a))) > 1 - 2fc7. 
Clearly F"''' is the union of fibers of the form F (z"'''), and also F (^{l,x°2f^^ C F"''' if and only if 

f((-i,x-''))cf-\ 

5.5 Boundaries of boundaries 

In the refined graph setting, just like in the general rankings graph setting, we also look at boundaries of 
boundaries. 

For a given vector z"']" of preferences between a and 5, we can think oi F { z"^^ ) as a subgraph of the 
original refined rankings graph 5^, i.e. two ranking profiles in F (^z'^^^ are adjacent if they differ by one 

adjacent transposition in exactly one coordinate. Since both of the ranking profiles are in F ^z° ^''^ , this 

adjacent transposition keeps the order of a and b in all coordinates, and moreover it keeps a and b adjacent 
in coordinate i. 
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We choose to slightly modify this graph: the vertex set is still F (^2°^^^ , but we modify the edge set by 



c 
d 
a 
b 

Define in this way a' — and a" — (cr-', cr-i), and add (cr, a') and (cr, a") to the edge set. So basically, 

we consider the block of a and b in coordinate i as a single element, and connect two ranking profiles in 
F {^z'^f^ if they differ in an adjacent transposition in a single coordinate, allowing this transposition to move 

the block of a and b in coordinate i. We call this graph G [z°2f^ — (^F (^2°^^^ , E (-^^-f)) i where E (^^J^f ^ is 
the edge set. 

When we write de (^Bi (^z"^^^^, we mean the edge boundary of Bi ^2°'f^ in the graph G ^z^'^"^ , and 
similarly when we write d [Bi ( z"2^ ) ) , we mean the vertex boundary of Bi ( z°2^ ) in the graph G ( z"'' 
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5.6 Reverse hypercontractivity 

We again use Lemma 12.41 about reverse hypercontractivity. 



5.7 Local dictators, conditioning and miscellaneous definitions 

In the general rankings graph setting wc defined a dictator on a subset of the alternatives, but in the refined 
rankings graph setting we need to define so-called local dictators. 

Definition 20 (Local dictators). For a coordinate i and a subset of alternatives HQ [k], define LOf^ to 
be the set of ranking profiles a such that the alternatives in H form an adjacent block in Ui, and permuting 
them among themselves in any order, the outcome of the SCF f is always the top ranked alternative among 
those in H. If cr £ LOf^, then we call cr a local dictator on H in coordinate i. 
Also, for a pair of alternatives a and b, define 

LD,(a,6):= |J LDf''''^}, 

the set of local dictators on three alternatives, two of which are a and b, in coordinate i. 

In dealing with local dictators, we will condition on the top of a particular coordinate being fixed. We 
therefore introduce the following notation. 

Definition 21 (Conditioning). For any coordinate i G [n] and any vectors of alternatives we define 

Pr(-):-P(-|(^.(l),...,^.(|v|))=v), 

where |v| denotes the length of the vector v. E.g. P^""* (•) = P (• | cti (1) — a) and 

p('''''^=)=P(.|(ai(l),ai (2),ai (3))-(a,6,c)). 

We use the following notation in the proof of Theorem 11.51 

Definition 22 (Majority function). For a function f whose domain X is finite and whose range is the set 
{a,b}, define Ma,i{f) by 



Maj (/) = 



if #{xeX:f{x) = a}>#{xeX:f{x) = b}^ 
if #{xeX:f{x)^a}<#{xeX:f{x)^b} 
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6 Quantitative Gibbard-Satterthwaite theorem for one voter 



In this section we prove our quantitative Gibbard-Satterthwaite theorem for one voter, Theorem 11.41 As 
mentioned before, we present this proof before proving Theorem 17. 1[ because the proof of Theorem 17.11 
follows the lines of this proof, with slight modifications needed to deal with having n > 1 coordinates. 

For the remainder of this section, let us fix the number of voters to be 1, the number of alternatives fc > 3, 
and the SCF /, which satisfies D (/, NONMANIP) > e. Accordingly, we typically omit the dependence of 
various sets (e.g. boundaries between two alternatives) on /. 

An additional notational remark: since our SCF is on one voter only, we omit the subscripts that denote 
the coordinate we are on. E.g. we write simply Inf''' instead of Inf"''', etc. 

We present the proof in several steps. 



6.1 Large boundary between two alternatives 

The first thing we have to establish is a large boundary between two alternatives. This can be done just 
like in Lemma 15.31 except there are two small differences. On the one hand, the assumption of the lemma, 
namely that D (/, NONMANIP) > e, is weaker than that of the original lemma. On the other hand, here 
we only need one big boundary, unlike in Lemma 15. 3[ where Isaksson et al. P2] showed that there arc two 
big boundaries in two different coordinates. The following lemma formulates what we need. 

Lemma 6.1. Recall that f is a SCF on 1 voter and fc > 3 alternatives which satisfies D (/, NONMANIP) > 
£. Let a £ Sk be selected uniformly. Then either 

(22) P(^eM2)>l£ 
or there exist alternatives a,b £ [k], a ^ b such that 

(23) ij.i'^Ma-M > 

Proof. The proof is just like the proof of Lemma [5.31 First, suppose that Inf^'^'^ > || for some pair of 
alternatives a b, and transposition z [a : b]. Then by Lemma [5. 11 for any point (cr, a') € B"-'''^, at least 
one of a or a' — za is a 2-manipulation point. Then 

IM2I > = 2 • fc! • Inf"''''^ > ^fc!, 

and dividing with k\ gives (P^ . So for the remainder of the proof we may assume that lid""'^'^ < || for every 
a b and z y^ [a :b]. 

For every a G [^], D (^f,to-p^^y^ > e, so P (/ (cr) = a) < 1 — e. On the other hand, there exists an 
alternative, say a £ [k], such that P (/ (ct) = a) > ^. So for this alternative we have 

Var(l[/ (a)=a])>^. 



and consequently using 15, Corollary 6.5] and [TSl Proposition 2.3] we have 



J2 J2 Inf'^'"^'" = E I^f"^"" ^ T2 Var (1 [/ (a) = a]) > 



e 



Hence there must exist some w G T and b ^ a such that Inf°' > ||, but by our assumption we must have 
w = [a : b]. □ 

If ([22I1 holds, then we are done, so in the following we assume that ([23|l holds. 

We know that a is on B"-'"^"-'-^^ if / (cr) — a and / ([a : 6] cr) = b. We know that if 6 > a, then cr is a 
2-manipulation point, so if this happens in more than half of the cases when cr is on B°'-''-^'^-''\ then we have 

P(aeM2)>|, 
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(24) P(aeS)>H£ 



in which case we are again done. So we may assume in the following that 

2£ 

|cr : f{a)^a,f ([a : b]a) = b,a > bj 



where 



B 



6.2 Division into cases 

We again divide into two cases. 

We introduce the set F of permutations where a is directly above b: 



- j cr a' / r 1 1 

F := < a Sk ■ a > b and b > a, where cr — [a : b\(7 > 



One of the following two cases must hold. 
Case 1: Small fiber case. We have 

(25) ^{a B\a <E F) <l ^ 



Case 2: Large fiber case. We have 



(26) f{aeB\(jeF)>l-^. 

6.3 Small fiber case 

In this section we assume that p5|) holds. 

We first formalize that the boundary d (B) of B is big. (Recall the definition of d [B) from Section [5?5| 
where we first introduced the graph G on the vertex set F where two ranking profiles are adjacent if they 
differ in a single transposition, but treating the block formed by adjacent alternatives a and 6 as a single 
element.) The proof uses the canonical path method, as successfully adapted to this setting by Isaksson, 
Kindler and Mossel |15) . 

Lemma 6.2. // (|25p holds, then 

(27) ^^„ed{B))>^¥{aeB). 

Proof. As mentioned before, the proof is a canonical path argument. Let B'^ = F\B. For every (cr, ct') G 
BxB'^, we define a canonical path from cr to cr', which has to pass through at least one edge in de {B). Then if 
we show that every edge in de (B) lies on at most r canonical paths, then it follows that \de {B)\ > \B\ \B'^\ jr. 

So let [<T.,<y'^ G B X B'^. We apply the path construction of [151 Proposition 6.4.], but considering the 
block formed by a and 6 as a single element. Since this path goes from cr (which is in B) to a' (which is in 
B'^), it must pass through at least one edge in de {B). 

For a given edge (tt, tt') e de (B), at most how many possible (cr, a') G B x B'^ pairs are there such that 
the canonical path between cr and a' defined above passes through (tt, tt')? We learn from [151 Proposition 
6.4.] that there are at most (fc — 1) {k — l)!/2 < k^ {k — 1)1/2 possibilities for the pair {a, a'). 

Recall that |F| = {k - 1)!. By our assumption we have \B\ < (l - ^) (fc - 1)!, and so > ^ (fc - 1)!. 
Therefore 

Now in G every ranking profile has k — 2 < k neighbors, which implies (|?7)) . □ 
Corollary 6.3. // ^ holds, then 

(28) p(^aed{B))>^. 
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Proof. Combine Lemma [6.21 and (IMl) . □ 

Next we want to find manipulation points on the boundary d{B). The next lemma tells us that if we 
are on the boundary d (B), then either we can find manipulation points easily, or we are at a local dictator 
on three alternatives. 

Lemma 6.4. Suppose that a E d (B). Then 

• either a G LD (a, 6), 

• or there exists a € M3 such that a is equal to a or [a : b] a except that the position of a third alternative 
c might be shifted arbitrarily. 

Proof. Since a E d (B) C B, we know that / (cr) = a, and if a' = [a : b] cr, then / (cr') = b. Let tt E B'^ denote 
the ranking profile such that (cr, tt) S de (B), and let n' — [a : b] n. Since n ^ B, {f (tt) , / (tt')) ^ (a, 6). Then, 
by Lemma |5. 11 if / (tt) ^ / (tt'), then one of tt and tt' is a 2-manipulation point. So assume / (tt) = / (tt'). 

There are two cases to consider: either a and tt differ by an adjacent transposition not involving the 
block of a and b, or they differ by an adjacent transposition that moves the block of a and b. 

In the former case, it is not hard to see that one of a, a', tt, tt' is a 2-manipulation point, by Lemma |5. II 
If a and tt differ by an adjacent transposition that involves the block of a and 6, then there are again 
two cases to consider: either this transposition moves the block of a and b up in the ranking, or it moves it 
down. 

If the block of a and b is moved up to get from a to tt, then we must have / (vr) — a, or else a or tt is 
a 3-manipulation point. Then we must have / (tt') = / (vr) = a, in which case tt' is a 3-manipulation point, 
since / (cr') — b. 

The final case is when the block of a and b is moved down to get from a to tt, and a third alternative, 
call it c, is moved up, directly above the block of a and b. Now if / (vr) = d ^ {a, 6,c}, then cr or tt is a 
3-manipulation point. If / (tt) = / (tt') = a, then tt' is a 3-manipulation point, whereas if / (tt) — f (tt') — b, 
then TT is a 3-manipulation point. The remaining case is when / (tt) — f {tt') — c. Now if / ([fe : c] ct) 7^ a or 
f {[a : c] a') ^ b, then we again have a 3-manipulation point close to cr. Otherwise a £ LD (a, b). □ 

The following corollary then tells us that either we have found many 3-manipulation points, or we have 
many local dictators on three alternatives. 

Corollary 6.5. // (|25|) holds, then either 

2 

(29) IP('7eLD^"'''^">) =P(f7eLD(a,6))> ^ 

or 

£2 



2fcio 



6.3.1 Dealing with local dictators 

So the remaining case we have to deal with in this small fiber case is when holds, i.e. we have many 
local dictators on three alternatives. 

Lemma 6.6. Suppose a e LD'f'^'^^^)' fgr some alternative c ^ {a,b}. Let a' be equal to a except that the 
block of a, b and c is moved to the top of the coordinate. Then 

• either a' £ LD^"'''^^^, 

• or there exists a 3-manipulation point a which agrees with a except that the positions of a, b and c 
might be shifted arbitrarily. 
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Proof. W.l.o.g. we may assume that in a alternative a is ranked above 6, which is ranked above c. Now 
bubble up a to the top, using adjacent transpositions. If at any point during this the outcome of / is not a, 
then we have found a 2-manipulation point. Now bubble up b to right below a, and then bubble up c to be 
right below b. Again, if at any point during this the outcome of / is not a, then there is a 2-manipulation 
point. Otherwise we now have a, b and c at the top (in this order), with the outcome of / being a. Now 
permuting alternatives a, b and c at the top, we either have a 3-manipulation point, or a' 

Corollary 6.7. // (gH]) holds, then either 

(30) J2 P(^eLD^^''''^\{'T(l),^(2),a(3)} = {a,6,c}) >^ 

c^{a,b} 



' {<y e A/a) > 



Proof. Lemma 16.61 tells us that when we move the block of a, b, and c up to the top, we either encounter a 
3-manipulation point, or we get a local dictator on {a, b, c} at the top. 

If we get a 3-manipulation point, by the describtion of this manipulation point in the lemma, there can 
be at most ranking profiles that give the same manipulation point. 

If we arrive at a local dictator at the top, then there could have been at most k different places where 
the block of a, b and c could have come from. □ 

Now ([30]) is equivalent to 

2 



(31) E ^ e LD^''''''^}, {a (1) , a (2) , a (3)) = (a, b, c)) > 

cf{a,b} 



£ 



2Ak 



11 ■ 



We know that 



■((,(1), .(2),.(3))^(„,M)) 



and so (|31|) implies (recall Definition [2T|) 

c({a,b} 

Now fix an alternative c ^ {a,b} and define the graph G^a,b.c) = {V(a,b.c), E(^a,b,c)) to have vertex set 

V(a.M WeSk: {a (1) , a (2) , a (3)) = (a, b, c)} 

and for cr, tt S V(a,b,c) Ist {a, tt) £ £'(a.b,c) if ^nd only if a and tt differ by an adjacent transposition. So G(^a.b,c) 
is the subgraph of the refined rankings graph induced by the vertex set V(^a,b,c)- (If A; = 3 or fc = 4, then this 
graph consists of only one vertex, and no edges.) 
Let 

T(a,&,c) ■.= V^a,b.,)nLB^''-'''\ 

and let de (T (a, b, c)) and d (T (a, 6, c)) denote the edge- and vertex-boundary of T (a, 6, c) in G(a,b,c) ■, respec- 
tively. 

The next lemma shows that unless T (a, b, c) is almost all of V(^a,b,c)j the size of the boundary d (T (a, 6, c)) 
is comparable to the size of T (a, 6, c). The proof uses a canonical path argument, just like in Lemma [ 



Lemma 6.8. Let c ^ {a, b} be arbitrary. Write T = T {a, b, c) for simplicity. //p('*^''''=) {a eT) <1-S, then 

(33) p('''^''=) ((T e 9 (T)) > _p('^'f'^=) (fj e T) . 
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Proof. Let T'^ = V(a.6,c) \ T{a,b,c). For every (cr, ct') E T x T'^, we define a canonical path from a to cr', 
which has to pass through at least one edge in de {T). Then if we show that every edge in de (T) lies on at 
most r canonical paths, then it follows that \de {T)\ > \T\ \T'^\ jr. 

So let (cr, cr') E T X r^. We apply the path construction of [15, Proposition 6.4.], but only to alternatives 
[k]\{a,b,c}. 

The analysis of this construction is done in exactly the same way as in Lemma 16.21 in the end we get 
that there are at most (fc — 3)! paths that pass through a given edge in de (T). 

Recall that |V(„,fc^c)| = (fc - 3)! and that by our assumption |T| < (1 - 5) (fc - 3)!, so \T''\ > S {k - 3)!. 
Therefore 

ITI IT'^I S 



Now every vertex in V(^a,b.c) has fc — 4 < fc neighbors, which implies (j33p . □ 

The next lemma tells us that if cr is on the boundary of a set of local dictators on {a, 5, c} for some 
alternative c ^ {a, b}, then there is a 2-manipulation point a which is close to cr. 

Lemma 6.9. Suppose a E d (T {a,b, c)) for some c ^ {a,b}. Then there exists a E M2 which equals za for 
some adjacent transposition z that does not involve a, b or c, except that the order of the block of a, b and c 
might be rearranged. 

Proof. Let n be the ranking profile such that (cr, tt) E {T (a, &, c)), and let z be the adjacent transposition 
in which they differ, i.e. tt = za. Since tt ^ T{a,b,c), there exists a reordering of the block of a, b, and c 
at the top of tt such that the outcome of / is not the top ranked alternative. Call the resulting vector tt'. 
W.l.o.g. let us assume that tt' (1) = a. Let us also define a' := ztt'. Now tt' is a 2-manipulation point, since 
f{a') = a. □ 



The next corollary puts together Corollary 16.71 and Lemmas 16.81 and 16.91 

Corollary 6.10. Suppose pop holds. Then if for every c i. {a, 6} we have p(«>*'^=) (cr e T (a, fe, c)) < 1 - -r^, 
then 



'(cr E Ml) > 



£3 



IQSfcie- 

Proof. We know that (pO|) implies 

E IP"'^(-eT(a,5,c))>^. 

c^{a,b} 

Now using the assumptions. Lemma 16.81 with S = j^q^^ and Lemma 16.91 we have 

P(aeM2)> e ^^2) > J2 ^P^"''''"H'^^d{T{a,b,c))) 

c^{a,b} c^{a,6} 
cf{a,b} 

So again we are left with one case to deal with: if there exists an alternative c ^ {a, b} such that 
f>{a,b:c) (^cr £ T {a,b, c)) > 1 — Define a subset of alternatives K C [fc] in the following way: 

K ■.^ia,b}uUE [fc] \ la, 61 : P^"^"'"^ (a E T (a, 6, c)) > I - 



{a, 6} U |c e [fc] \ {a, 6} : p("^''''=) {a E T (a, 6, c)) 



lOOfc- 



In addition to a and b, K contains those alternatives that whenever they are at the top with a and 6, they 
form a local dictator with high probability. 
So our assumption now is that |-ftr| > 3. 

Our next step is to show that unless we have many manipulation points, for any alternative c E K, 
conditioned on c being at the top, the outcome of / is c with probability close to 1. 
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Lemma 6.11. Let c £ K . Then either 



£ 



(34) P(^)(/(fT) = c)>l 



(35) P ((7 e Ma) > ^ 



100fc4' 



Proof. First assume that c ^ {a, 6}. 

Let a be uniform according to P^'^\ i.e. uniform on S'fc conditioned on cr(l) = c. Define cr', where cr' 
is constructed from a by first bubbhng up ahernative a to just below c, using adjacent transpositions, and 
then bubbfing up b to just below a. Clearly a' is distributed according to p(^^''^''), i.e. it is uniform on Sk 
conditioned on (tr (1) , a (2) , ct (3)) = (c, a, h). 

Since c € we know that p(^.°.'') (^cr e lD^"'^'"'^^ > 1 - This also means that 



(ff' e LD^'^^''''=>) > 1 



lOOfc 

Now we can partition the ranking profiles into three parts, based on the outcome of the SCF f at a and cr': 

11 = {a:f (a) = c, f (a') = c} 

12 = W:f ^cj {a') = c} 

If P(<=) (/i) > 1 - gf^, then dMl) holds. Otherwise we have P^'^) {h U h) > and since P^'^) (/g) < we 
havePW(/2)>^. 

Now if cr e I2, then we know that there is a 2-manipulation point along the way as we go from a to 
a'. I.e. to every a d I2 there exists a G such that a is equal to cr except perhaps a and b are shifted 
arbitrarily. So there can be at most ranking profiles a giving the same 2-manipulation point tr, and so 
we have 

P(, e M2) > ip(^) (a e Ah) > ^P(^) ih) > 

showing 

Now suppose c € {a, 6}, w.l.o.g. assume c = a. We know that |_ftr| > 3 and so there exists an alternative 
d £ K \ {a, 6}. We can then do the same thing as above, but we now bubble up b and d. □ 

We now deal with alternatives that are not in K: either we have many manipulation points, or for any 
alternative d ^ K, the outcome of / is not d with probability close to 1. 

Lemma 6.12. Let d ^ K. IfP{f{a) = d) > ^, then 

P(cr G M2) > 



Proof. Let cr be such that / (cr) — d. Bubble up d to the top, and call this ranking profile cr'. Now if 
/ (cr') ^ d, then we know that there exists a 2-manipulation point a along the way, i.e. a 2-manipulation a 
which agrees with a except perhaps d is shifted arbitrarily. Consequently, either 

P(.eM.)>^, 

in which case we are done, or 

P(a:f{a)=f{a') = d)>-^. 

Next, let us bubble up a to just below d, and then bubble up b to just below d. Denote this ranking 
profile by cr('^^''''^), and analogously define cr(''^'^^''), a'"-''^''), cr^"^'*-''), cr(^'"''*\ and a^'>,d,a}^ Either we encounter a 
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2-manipulation point a along the way of bubbling up to a^'^'^'°''> {a agrees with cr except d is at the top, and 
a and b might be arbitrarily shifted), or the outcome of the SCF f is d all along. So we have that either 

P{aeM2)> ^ 



16/c3' 

in which case we are done, or 

a:f{a) = f {a') = f [a^'-'^^^^) = f (a^'^''''")) = d) > 

Now start from a'^'^''^'''\ First swap a and d to get a'-''''^'''\ then swap d and b to get a'^'^''^''''\ and finally 
bubble d and b down to their original positions in a, except for the fact that a is now at the top of the 
coordinate. Call this profile a. Since a is uniformly distributed, a is distributed according to P^'^'', i.e. 
uniformly conditional on a (1) = a. Now note that one of the following three events has to happen. (These 
events are not mutually exclusive.) 

h = {.f = / (^a^-'"-^^) = a} 

I3 — {a : 3 a £ M2 which is equal to a except a is shifted 
to the top, and b and d may be shifted arbitrarily}. 

Since a € K , we know by Lemma 16.111 that (unless we already have enough manipulation points by the 
lemma) we must have 

P(/(a)^a)=p('^) {f{a)^a)<^. 

Consequently 

- - 17£ 

= d] > 



P (/i U h, f{a)^f (a') = / (^a^'iM^ = f (^a^d,a,b)^^ = d) 

and so either 



16fc 50k 400fc' 



17e 



800fc3' 

in which case we are done, or 

17e 



SOOfc 



P (a : / = / (^('^■'^■^)) = d, f (a^'^^^^'^)) = / (a^^'^^*'")) = a) > 

Next, we can do the same thing with b on top, and we ultimately get that cither 

^('^^^^^^^lecb' 

in which case we are done, or 

(36) P(°'^''') (^(j'^'^.bA) ^ LD^"'''''^>) = P (ct : (T^'^'^^'^) e j^j^iaAdj'j > Y^^- 

Define G(^a,b,d) and T(^a,bA) analogously to G,^a,b,c) and T(^a,b,c), respectively. 

Suppose that ([55]) holds. We also know that d ^ K, so Lemma applies, and then Lemma shows 
us how to find manipulation points. We can put these arguments together, just like in the proof of Corollary 
I6.10[ to show what we need: 

P (a e M2) > l^p(^'>'^^^ {a e M2) > ^^^"'''^^ i<JGd{T (a, b, d))) 
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Putting together the results of the previous lemmas, there is only one case to be covered, which is covered 
by the following final lemma. Basically, this lemma says that unless there are enough manipulation points, 
our function is close to a dictator on the subset of alternatives K. 

Lemma 6.13. Recall that we assume that D (/, NONMANIP) > e. Furthermore assume that \K\ > 3, for 
every c ^ K we have 

(37) P(^'(/(a) = c)>l-^, 



and for every d ^ K we have 



Then 



P(/(^)=d)<^. 



(38) P(^eM2)>^. 
Proof. First note that 

P (/ {<y) ^ top^ (^)) <V{f{a)iK)+^ if (a) ^ top^ (a) Jia)eK). 

We know that 

e < D (/, NONMANIP) < P (/ (a) ^ top^ {a)) 

and also that 

which together imply that necessarily 

P(/My^top^ {<T),f{'j)eK)>^. 

Let a be such that / {a) ^ top^ {a) and / (u) G K. Now bubble top^ (cr) up to the top in ct, call this 
ranking profile g. Clearly then top^- (cr) = top^ (tr). 

There are two cases to consider. If / (a) ^ f i^), then there is a 2-manipulation point along the way 
from a to a, i.e. a 2-manipulation point a such that a agrees with a except perhaps some alternative c is 
arbitrarily shifted. Otherwise f {(j) = f and so / {a) ^ top^^ [a). 

Consequently we have that either (|38|) holds, or that 

(39) P(a:/(a)^top,^(a))>|. 

By the construction of (t, we know that a is uniformly distributed conditional on (1) e K . Consequently, 
by (|37|) . we have that 

P(a:/(a)^top^ i^)) < 

which contradicts with (l39t since < |. □ 
This concludes the proof of the small fiber case. 

6.4 Large fiber case 

In this section we assume that (|26p holds. We show that we either have a lot 2-manipulation points or we 
have a lot of local dictators on three alternatives. 
Our first step towards this is the following lemma. 



Lemma 6.14. Suppose ([25]) holds. Th 



en 



(40) p(''''')(CTeB)>l-|. 
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Proof. Let B"^ = F \ B. Our assumption (^5]) implies that P (cr € S"^ | cr € F) < which means that 
< and so 

P^'^^^) (a ^ B) < ^^^^^ < -, 



which is equivalent to (j40|) . □ 

The next lemma (together with Section Rj.S.ip concludes the proof in the large fiber case. 
Lemma 6.15. Suppose holds and recall that our SCF f satisfies D (/,NONMANIP) > e. Then either 

(41) P(<TeM2)>^ 

or 

(42) P(^eLD(a,6))>^. 

Proof. By Lemma FB.f 41 we know that (|40|) holds. 

Let a e Sk be uniform. Define a' by being the same as a except alternatives a and b are moved to the 
top of the coordinate: a' (1) = a and cr' (2) = b. Clearly cr' is distributed according to P('''^) (•). Also define 
a" = [a:b]a'. 

We partition the set of ranking profiles Sk into three parts: 

h ■■={cTeSk:f (a) = top{„,,j (a) , (/ {a') , f {a")) = (a, b)] 

h := {a e 5fc : / {a) ^ tov^^M (^) ' (/ (^') ' / (^")) = («' ^)} 
/3:-{'Te5fe:(/(a'),/(a"))^(a,6)}. 

By (go]) we know that P (cr G /s) < |. We also know that P (d G /i) < since D (/, NONMANIP) > e. 
Therefore we must have 

P(cTe/2)>|>|. 

Let us partition I2 further, and write it as /2 = /2 U (Uc^{a,6}^2,c) , where 

I2 -{^^h-f (t) ^ top{,,,j (a) , / (a) e {a, 6}} 

and for any c ^ {a, 6}, 

/2,c := {'T e /2 : / (a) = c} . 

Suppose cr e W.l.o.g. let us assume that a is ranked higher than 6 by cr, and therefore / (cr) = 6, 
since a £ li^. Then we can get from cr to cr' by first bubbling up a to the top, and then bubbling up b to 
just below a. Since / (cr) = b and / (cr') = a, there must be a 2-manipulation point ct along the way, which 
is equal to cr except perhaps the positions of a and 6 are arbitrarily shifted. 

Now suppose that cr S /2,c for some c ^ {a, 6}. We distinguish two cases: either c is ranked above both 
a and b in cr, or it is not. 

If not, then say a is ranked above c in cr. Bubble a all the way to the top, and then bubble b as well, all 
the way to the top, just below a. Since / (cr) = c and / (cr') = a, there must be a 2-manipulation point a 
along the way, which is equal to a except perhaps the positions of a and b are arbitrarily shifted. 

If c is ranked above both a and b in cr, then the argument is similar. First bubble up a and 6 to just below 
c, and denote this ranking profile by ct, then permute these three alternatives arbitrarily, and then bubble 
a and b to the top. It is not hard to think through that either there is a 2-manipulation a along the way, 
which is then equal to a except perhaps the positions of a and b are arbitrarily shifted, or else ir E LD^'^'''''^' . 

Combining these cases we see that either (|41l) or (|42|) must hold. □ 

So if (|4T1) holds then we are done, and if (|42|) holds, then we refer back to Section 16.3.11 where we deal 
with the case of local dictators on three alternatives. 
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6.5 Proof of Theorem 11.41 concluded 



Proof of Theorem \l-4\ Our starting point is Lemma 16. 1[ which imphes that ([M]) holds (unless we already 
have many 2-manipulation points). We then consider two cases, as indicated in Section [6.21 

We deal with the small fiber case — when (l^5|) holds — in Section 16.31 First, Lemma 16.21 Corollary 16.31 
Lemma 16.41 and Corollary 16.51 show that either there are many 3-manipulation points, or there are many 
local dictators on three alternatives. We then deal with the case of many local dictators in Section 16.3.11 
Lemma 16.61 Corollary 16.71 Lemmas 16.81 and 16.91 Corollary 16.101 and Lemmas 16.111 16.121 and 16.131 together 
show that there are many 3-manipulation points if there are many local dictators on three alternatives, and 
the SCF is e-far from the family of nonmanipulable functions. 

We deal with the large fiber case — when I^E^ holds — in Section Here Lemma [6.151 shows that either 
we have many 2-manipulation points, or we have many local dictators on three alternatives. In this latter 
case we refer back to Section 16.3.11 to conclude the proof. □ 



7 Inverse polynomial manipulability for any number of alterna- 
tives 

In this section we prove the theorem below, which is the same as our main theorem. Theorem 11.21 except 
that the condition of D (/, NONMANIP) > e from Theorem 11.21 is replaced with the stronger condition 
D (/, NONMANIP) > e. 

Theorem 7.1. Suppose we have n > 2 voters, fc > 3 alternatives, and a SCF f : S"^ [k] satisfying 
D (/, NONMANIP) > e. Then 

(43) P (a e Af (/)) > P (a e M4 (/)) > p f £, -, ^ 

\ n k 

for some polynomial p, where a €z SJ^ is selected uniformly. In particular, we show a lower bound of ]^Q0^tVfc4ti ■ 
An immediate consequence is that 

P ((fj, a') is a manipulation pair for f)>q ( e, — , — 

for some polynomial q, where a € SJ^ is uniformly selected, and a' is obtained from a by uniformly selecting 
a coordinate i € {I, . . . ,n}, uniformly selecting j g {1, . . . , n — 3}, and then uniformly randomly permuting 
the following four adjacent alternatives in ai'. Ci (j) , ai (j -|- 1) , (7^ (j -|- 2), and ai {j -|- 3). In particular, the 
specific lower bound for P (cr G M4 (/)) implies that we can take q (e, -i, = xqtttjfpt- 

For the remainder of the section, let us fix the number of voters rt > 2, the number of alternatives fc > 3, 
and the SCF /, which satisfies D (/, NONMANIP) > e. Accordingly, we typically omit the dependence of 
various sets (e.g. boundaries between two alternatives) on /. 

7.1 Division into cases 

Our starting point in proving Theorem 17.11 is Lemma 15.31 Clearly if (|20p holds then we are done, so in the 
rest of Section [7] we assume that this is not the case. Then Lemma [5.31 tells us that ([2T]) holds, and w.l.o.g. 
we may assume that the two boundaries that the lemma gives us have i — \ and j = 2. I.e. we have 

""'^ P(.oni?, ^ 'j>-^, 

where recall that a is on B'^''''^""^^ if / (cr) = a and / ([a : h]^ a) = b. li a is on ^"'^•["•''l a^cj b > a, then cr is a 
2-manipulation point, so if this happens in more than half of the cases when a is on ^"•''•["■''l ^ then we have 



'(-^M.)>-|, 
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and we are done. Similarly in the case of the boundary between c and d in coordinate 2. So we may assume 
from now on that 

F e U^.i., (.-)) > I, and P (. e U^,.B. (.1,^)) > ^. 

The following lemma is an immediate corollary. 
Lemma 7.2. Either 

(44) p(aESm(5r-^^[-^l))>-i, 
or 

(45) K^^lK^'"'))^;!^' 
and t/ie same can be said for the boundary ^^-'''I'^ '^l _ 

We distinguish cases based upon this: either (jH]) holds, or pij) holds for the boundary ^^■'''['^■''1^ or 
holds for both boundaries. We only need one boundary for the small fiber case, and we need both 
boundaries only in the large fiber case. So in the large fiber case we must differentiate between two cases: 
whether d G {a, 6} or c? ^ {a, b}. First of all, in the d ^ {a, b} case the problem of finding a manipulation 
point with not too small (i.e. inverse polynomial in n, k and e~^) probability has already been solved by 
Isaksson, Kindler and Mossel [15], so we are primarily interested in the d G {a,b} case. But moreover, we 
will see that our method of proof works in both cases. 

In the rest of the section we first deal with the small fiber case, and then with the large fiber case. 



7.2 Small fiber case 

We now deal with the case when (pij) holds. We formalize the ideas of the outline in a series of statements. 

First, we want to formalize that the boundaries of the boundaries are big in this refined graph setting as 
well, when we are on a small fiber. The proof uses the canonical path method, as successfully adapted to 
this setting by Isaksson, Kindler and Mossel [15] , and is very similar to the proof of Lemma 16. 2| with some 
necessary modifications due to the fact that we now have n coordinates. 

Lemma 7.3. Fix a coordinate and a pair of alternatives — for simplicity we choose coordinate 1 and alter- 
natives a and b, but we note that this lemma holds in general, we do not assume anything special about 

these choices. Let be such that Bi is a small fiber for B'^^'^""^^ Then, writing B = Bi ^z^f^ for 

simplicity, we have 

(46) P(^ea(i?))>-^P(aeS). 

Proof. Let B'^ = F ^^"f^ \ B. For every {a, a') G B x B'^, we define a canonical path from a to a', which 

has to pass through at least one edge in de (B). Then if we show that every edge in de [B) lies on at most r 
canonical paths, then it follows that \de {B)\ > \B\ /r. 

So let {cr,a') € B X B'^. We define a path from a to a' by applying a path construction in each coordinate 
one by one, and then concatenating these paths: first in the first coordinate we get from cri to a'l, while 
leaving all other coordinates unchanged, then in the second coordinate we get from a2 to a 2, while leaving 
all other coordinates unchanged, and so on, finally in the last coordinate we get from (T„ to a'^^. In the first 
coordinate we apply the path construction of [T5j Proposition 6.4.], but considering the block formed by a 
and 6 as a single element; in all other coordinates we apply the path construction of |151 Proposition 6.6.]. 
Since this path goes from a (which is in B) to a' (which is in B'^), it must pass through at least one edge in 
de{B). 

For a given edge (tt, tt') € de (B), at most how many possible {a, a') E B x B'^ pairs are there such that 
the canonical path between cr and a' defined above passes through (tt, tt')? Let us differentiate two cases. 
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Suppose TT and tt' differ in tfie first coordinate. Tlien coordinates 2 tlirougli 71 of ct must agree with 
the respective coordinates of tt, while coordinates 2 through n of a' can be anything (up to the restriction 

given hy a' € B'^ C F giving (-1^)" ^ possibilities. Now fixing all coordinates except the first, 

Proposition 6.4.] tells us that there are at most (fc — 1)^ (fc — < k"^ {k — 1)! possibilities for the pair 

{(Ti^a'i). So altogether there are at most fc^ (fc — 1)! (3-)" paths that pass through a given edge in de (B) 
in this case. 

Suppose now that tt and tt' differ in the i^^ coordinate, i ^ 1. Then the first i — 1 coordinates of a' must 
agree with the first i ~ 1 coordinates of tt, while coordinates i + 1, ri of must agree with the respective 
coordinates of tt. The first i — 1 coordinates of cr, and coordinates i + 1, . . . , n of cr' can be anything (up 

to the restriction given by a,a' € F giving {k — 1)1 (^)" ^ possibilities. Now fixing all coordinates 

except the i*-^ coordinate, [TSl Proposition 6.6.] tells us that there are at most fc'^fc! possibilities for the pair 
{ai,af). So altogether there are at most 2k^ (k — 1)! (^)" ^ paths that pass through a given edge in (B) 
in this case. 

So in any case, there are at most 2k^ (fc — 1)! (^) paths that pass through a given edge in {B). 



Recall that 
Therefore 



F (z°f ) = (fc - 1)! (#)" \ and also JB^J > 7 (fc - 1)! (|)" ^ since S is a small fiber. 



' ^ 2fc4(fc-l)!(f)" " 



Now in G (^z'^i^ every ranking profile has no more than nk neighbors, which implies (1461) . □ 
Corollary 7.4. // gl]) holds, then 



\ 



2n2fci 



Proof. Using the previous lemma and (|44p we have 



> J2 FUedlB, (z^t 



,b:[a:!,]A 



^l'?:Bi(.^f)CSm(i3r''^'"^''0 



Next, we want to find manipulation points on the boundaries of boundaries. 

Before we do this, let us divide the boundaries of the boundaries according to which direction they 
are in. If cr G 9 (^Bi (-^-i)) some z^f, then we know that there exists a ranking profile tt such that 

((T, tt) G de (^Bi We know that a and tt differ in exactly one coordinate, say coordinate j; in this 

case we say that a is on the boundary of Bi (^z^j^lj in direction j, and we write a G dj (^Bi (^-^-1)) ■ (This 
notation should not be confused with that of the edge boundary.) 

We can write the boundary of Bi (^z^jfj as a union of boundaries in the different directions: 
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but note that this is not (necessarily) a disjoint union, as a ranking profile a for which cr G 5 (^Bi {^z°^^^ 

might lie on the boundary in multiple directions. 

In particular, we differentiate between the boundary in direction 1 and the boundary in all other direc- 
tions. To this end we introduce the notation 



With this notation we have the following corollary of Corollary 17.41 
Corollary 7.5. // (gU holds, then either 
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or 
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(48) ^^^-^^(^^ (-'))) ^4n^.-- 

Lemma 7.6. Suppose the ranking profile a is on the boundary of a fiber for b'^''''^"'''^ in direction j ^ 1, i.e. 



ae U Si 



a, 6 



Then there exists a 3-manipulation point a which agrees with a in all coordinates except perhaps coordinate 
1 and some coordinate j 7^ 1; furthermore di is equal to ai or [a : b]ai, except that the position of a third 
alternative c might be shifted arbitrarily, and aj is equal to aj or zaj for some adjacent transposition z G T, 
except the position of b might be shifted arbitrarily. 



Proof. Suppose a;°;_'J (cr) = Since a £ d yBi y^-ijj ^ ^1 (^^-iji know that / (ct) = a, and if 

<t' = [a : 6]-^ a, then / {a') = b. 

Let TT — {iTj, tT-j) denote the ranking profile such that (cr, tt) e de (^Bi (^z'^i^^ . Let tt' [a : tt. Since 

■n ^ Bi (^z°2\^, if {t^) 1 f W)) 7^ (Oi^)- Then, by Lemma [STTl if / (tt) 7^ .f W), then one of tt and tt' is a 
2-manipulation point. 

So let us suppose that / (tt) — f {t:'). If / (tt') ~ a, then one of a' and vr' is a 2-manipulation point by 
Lemma |5.1[ since tt' — zja' for some adjacent transposition z ^ [a : b]. If / (tt) = 6, then similarly one of a 
and TT is a 2-manipulation point. 

Finally let us suppose that / (tt) = c for some c ^ {a, b}. In this case Lemma [5?2] tells us that there exists 
an appropriate 3-manipulation point a. □ 



Corollary 7.7. // gZ]) holds, then 

7£ 



(49) P (cr e A/3) > 



Proof. Lemma l7.6l tells us that for every ranking profile a which is on the boundary of a fiber for ^"''''["■^1 in 
some direction j 7^ 1, there is a 3-manipulation point a "nearby"; the lemma specifies what "nearby" means. 

How many ranking profiles a may give the same ct? At most 2nfc^, which comes from the following: a 
and (7 agree in all coordinates except maybe two, one of which is the first coordinate; there are n — 1 < n 
possibilities for the other coordinate; in the first coordinate, cti is either cti or [a : b] ai (giving 2 possibilities), 
while some alternative c (fc — 2 < fc possibilities) might be shifted arbitrarily (at most k possibilities); in 
the other coordinate j 7^ 1, (jj is equal to aj or zaj for some adjacent transposition z d T (at most k 
possibilities), except 6 might be shifted arbitrarily (fc possibilities). 

So putting this result from Lemma [7.61 together with (|47l) yields (l49l) . □ 



The remaining case we have to deal with is when (|48|) holds. 
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Lemma 7.8. Suppose the ranking profile a is on the boundary of a fiber for B'^^'^'^""''^ in direction 1, i.e. 



Then either a G LDi {a,b), or there exists a 3 -manipulation point a which agrees with a in all coordinates 
except perhaps in coordinate 1; furthermore ai is equal to ui, or [a : b] ui except that the position of a third 
alternative c might be shifted arbitrarily. 



Proof. Just like the proof of Lemma 16.41 □ 

The following corollary then tells us that either we have found many 3-manipulation points, or we have 
many local dictators on three alternatives in coordinate 1. 



Corollary 7.9. Suppose (|48|) holds. Then either 



(50) H I" e LD^"''''"^) ^ P (a e LDi (a, b)) > 

c^{a,b} 

or 

P (<T e Ms) > 



2M2 



16n2A:i4- 



7.2.1 Dealing with local dictators 

So the remaining case we have to deal with in this small fiber case is when (|50p holds, i.e. we have many 
local dictators in coordinate 1. 

Lemma 7.10. Suppose a G Loj;"'''''^'^ for some alternative c ^ {a,b}. Define a' := {a[,a^i) by letting a[ be 
equal to u\ except that the block of a, b and c is moved to the top of the coordinate. Then 

• either a' G LD^'''''''^^ 

• or there exists a 3-manipulation point a which agrees with a in all coordinates except perhaps in 
coordinate 1; furthermore d\ is equal to o\ except that the position of a, b and c might be shifted 
arbitrarily. 

Proof. Just like the proof of Lemma 16.61 □ 
Corollary 7.11. // ((50|) holds, then either 



(51) E IP ('^ ^ LDl''-'^'\{a^ (1) , ai (2) , a, (3)} = {a, b, c}) > 

cf{a,b} 

or 

P (o- e M3) > '^^ 



167l2/cl3 



IGn^k^^ 

Proof. Just like the proof of Corollarv 16.71 □ 
Now ([5T|) is equivalent to 

(52) E ^ e LdI^^''^^ (ai (1) , (2) , ai (3)) = {a, b, c)) > 



96n2A:i3 " 



We know that 

P ((.r (1) , (2) , ar (3)) = (a, 6, c)) = < |, 



and so (|52p implies (recall Definition [2T|) 

(53) E If"!"'"''^ ("^ ^ L/^i"'"^'^) > 

c^{a,b} 



576n2fci 
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Now fix an alternative c ^ {a,b} and define the grapli G(^a,b.c) — (^(a,6.c)7 -E'(a,6.c)) to have vertex set 
ViaAc) ■■= W e ^fc : i^i (1) , ^1 (2) , ^1 (3)) = (a, 6, c)} 

and for cr, tt e b Ist (cr, tt) g -E(a.f),c) if ^-nd only if and tt differ in exactly one coordinate, and by an 
adjacent transposition in this coordinate. So G^a,b,c) is the subgraph of the refined rankings graph induced 
by the vertex set V(^a,b,c)- 
Let 

T,{a,b,c) F(,,b,c) n LZ?^°^'''=^ 

and let (Ti {a,b, c)) and 9(Ti(a, 6,c)) denote the edge and vertex boundary of Ti{a,b,c) in G(^a,b,c)i 
respectively. 

The next lemma shows that unless Ti (a, 6, c) is almost all of V(a,b,c) j the size of the boundary d (Ti (a, 6, c)) 
is comparable to the size of Ti (a, 6, c). The proof uses a canonical path argument, just like in Lemma |7.3[ 
and is very similar to the proof of Lemma 16.81 

Lemma 7.12. Let c ^ {a,b} be arbitrary. Write T = Ti {a,b,c) for simplicity. Iff[°"^''''^ {a ET) < 1- 6, 
then 

(54) P^''^"^ {aed (T)) > J^p[^''''^'> (aeT). 

Proof. The proof is essentially the same as the proof of Lemma 16.81 with a slight modification to deal with 
n coordinates. Let — V(^a.b,c) \ T (a, &, c). For every (a, cr') g T x we define a canonical path from a to 
cr' by applying a path construction in each coordinate one by one, and then concatenating these paths. In 
all coordinates we apply the path construction of 15, Proposition 6.4.], but in the first coordinate we only 
apply it to alternatives [k] \ {a, 6, c}. 

The analysis of this construction is done in exactly the same way as in Lemma 17.31 in the end we get 
that \de {T)\ > -p- |r|. Now every vertex in V(a,fc,c) has no more than nk neighbors, which implies (|54)) . □ 

The next lemma tells us that if cr is on the boundary of a set of local dictators on {a, b, c} for some 
alternative c ^ {a, 6} in coordinate 1, then there is a 4- manipulation point & which is close to a. The proof 
is similar to that of Lemma 16.91 but we have to take care of all n coordinates. 

Lemma 7.13. Suppose a e d (Ti (a,b, c)) for some c ^ {a,,b}. We distinguish two cases, based on the 
number of alternatives. 

If k ~ 3, then there exists a (3-) manipulation point a which differs from a in at most two coordinates, 
one of them being the first coordinate. 

If k > 4:, then there exists a ^-manipulation point a which differs from a in at most two coordinates, 
one of them being the first coordinate; furthermore, cti is equal to ai except that the order of the block of 
a, b and c might be rearranged and an additional alternative d might be shifted arbitrarily; and in the other 
coordinate, call it j , aj is equal to <7j except perhaps a, b and c are shifted arbitrarily. 

Proof. Let tt be the ranking profile such that (cr, tt) G de {Ti (a, b, c)), let j be the coordinate in which they 
differ, and let z be the adjacent transposition in which they differ, i.e. tt — ZjO. Since tt ^ Ti (a, 6, c), there 
exists a reordering of the block of a, 5, and c at the top of tti such that the outcome of / is not the top ranked 
alternative in coordinate 1. Call the resulting vector ttJ^, and let tt' := (tt^^, tt^i). W.l.o.g. let us assume that 
ttJ (1) = a. Let us also define a' :— Zjir' . We distinguish two cases: j = 1 and f 

If j = 1 (in which case we must have k > 5), tt' is a 2-manipulation point, since / (cr') — a. 

li j ^ 1, then there are various cases to consider. If the adjacent transposition z does not move a, then 
either tt' or cr' is a 2-manipulation point. So let us suppose that z = [a : d] for some d ^ a. 

Clearly we must have / (tt') = d, or else tt' or cr' is a 2-manipulation point. Suppose first that d e {6, c}. 
W.l.o.g. suppose that d = b. 

Then take alternative c in coordinate j of both cr' and tt', and bubble it to the block of a and b si- 
multaneously in the two ranking profiles. If along the way the value of the outcome of the SCF / changes 
from a or 6, respectively, then we have a 2-manipulation point by Lemma 15.11 Otherwise, we now have a, 
6, and c adjacent in both coordinates 1 and j. Now rearranging the order of the blocks of a, 6, and c in 
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these two coordinates (which can be done using adjacent transpositions), we either get a 2-manipulation 
point by Lemma I5.1[ or we can define a new SCF on two voters and three alternatives, a, 5, and c. This 
SCF takes on three values and it is also not hard to see that the outcome is not only a function of the 
first coordinate, so by the Gibbard-Satterthwaite theorem we know that this SCF has a manipulation point, 
which is a 3-manipulation point of the original SCF /. 

Now let us look at the case when d ^ {b,c}. In this case we do something similar to what we just did 
in the previous paragraph. In both a' and tt', first bubble up alternative d in coordinate 1 up to the block 
of a, b, and c, and then bubble b and c in coordinate j to the block of a and d. All of this using adjacent 
transpositions. If the value of the outcome of the SCF / changes from a or d, respectively, at any time 
along the way, then we have a 2-manipulation point by Lemma |5. II Otherwise, we now have a, 5, c and d 
adjacent in both coordinates 1 and j, and we can apply the same trick to find a 4-manipulation point, using 
the Gibbard-Satterthwaite theorem. □ 

The next corollary puts together Corollary 17.111 and Lemmas 17.121 and 17.131 

Corollary 7.14. Suppose (|5ip holds. Then if for every c ^ {a, b} we have p^"'''''^^ (cr g Ti (a, 6, c)) < 1— j^qj:, 
then 

^(^^^^^)^ 3456S^- 

Proof. We know that (ISTt implies 

c^{a,b} 

Now then using the assumptions. Lemma 17.121 with S ~ and Lemma 17.131 we have 

FiaeAh)> J2 ^pI"^'"^^^ (a e M4) > ^^i''''^(''^9iT,ia,b,c))) 

c^{a,b} c({a,b} 



> 



2 

^ 600^^^"'"^ (a e T, (a, 6, c)) > g^^g^^^^p^ ■ ° 

cf{a,b} 

So again we are left with one case to deal with: if there exists an alternative c ^ {a, b} such that 
j(a,b,c) ^ 6, c)) > 1 — jKm:. Define a subset of alternatives K C [k] in the following way: 



K 



{a, 6} U |c e [k] \ {a, b} : P^"''''"^ {a G Ti (a, b, cj) > 1 



lOOfc} 



In addition to a and b, K contains those alternatives that whenever they are at the top of coordinate 1 with 
a and b, they form a local dictator with high probability. 
So our assumption now is that \K\ > 3. 

Our next step is to show that unless we have many manipulation points, for any alternative c G K, 
conditioned on c being at the top of the first coordinate, the outcome of / is c with probability close to 1. 

Lemma 7.15. Let c E K. Then either 

(55) Pi^^(/(a) = c)>l-^, 
or 

(56) P(cr e Afz) > — 

Proof. Just like the proof of Lemma 16.111 □ 

We now deal with alternatives that are not in K: either we have many manipulation points, or for any 
alternative d ^ K, the outcome of / is not d with probability close to 1. 
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Lemma 7.16. Let d K. IfV[f{a) = d)>^, then 

P(cr e M4) > TTTTr^TTTT- 

Proof. The proof is very similar to that of Lemma l6.12l we do the same steps in the first coordinate as done 
in the proof of Lemma 16.121 and the fact that we have n coordinates only matters at the very end. 

Let a be such that / (cr) — d. We will keep coordinates 2 through n to be fixed as tT_i throughout the 
proof. By bubbling alternatives d, a, and b in the first coordinate, we can define cr', cr''^''''"-', a'^'^'°''^\ (j{o-,f>,d) ^ 
^{a,d,b) ^ ^(b,a,d) ^ ^^^^ ^{b,d,a) j^^g^ ^j^g proof of Lemma [6. 121 Again, we can show that either 

^(-^^^^)^16(b' 

in which case we are done, or 



> 



1600/c 



(57) P^'-''^ (^„i-,b.<i) g = P (a : d^'''''''^) G Ld['' '''''^^ 

Define G(^aA,d) and T(a,b,d) analogously to G[a,b,c) and T(a,b,c), res pectiv ely. 

Suppose that ((57)) holds. We also know that d ^ K, so Lemma [7. 121 applies, and then Lemma [7.131 shows 
us how to find manipulation points. We can put these arguments together, just like in the proof of Corollary 
I7.14( to show what we need: 

P(a e Mi) > J-P^"'"''^) (a e Ah) > -^P^"'"^ (a ed{T, {a,b,d))) 



2 

> — ^^Pl"'''''^aeTi(a,6,d)) > — □ 

Putting together the results of the previous lemmas, there is only one case to be covered, which is covered 
by the following final lemma. Basically, this lemma says that unless there are enough manipulation points, 
our function is close to a dictator in the first coordinate, on the subset of alternatives K. 

Lemma 7.17. Recall that we assume that D (/, NONMANIP) > e. Furthermore assume that \K\ > 3, for 
every c ^ K we have 

(58) Pi^^(/(fT) = c)>l-^, 



and for every d ^ K we have 



'(/(^) = ^)<i. 



Th 



en 



(59) P(^eM2)>^. 

Proof. Just like the proof of Lemma 17.171 □ 

To conclude the proof in the small fiber case, inspect all the lower bounds for P (cr G M4) obtained in 
Section [721 and recall that 7 = yqtj^^j^tt- 



7.3 Large fiber case 

We now deal with the large fiber case, when (P5|) holds for both boundaries, i.e. when 

p(.eLg(B-M'))>-i, 

and 

F(.eLs.(Bj*^))>^. 

We differentiate between two cases: whether d E {a, 6} or d ^ {a, b}. As mentioned before, the d ^ {a, b} case 
has already been solved by Isaksson, Kindler and Mossel [15]. But moreover, we will see that our method of 
proof works in both cases. 
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7.3.1 Case 1 

Suppose d € {a,b}, in which case w.l.o.g. we may assume that d = a. That is, in the rest of this case we 
may assume that 



(60) 

and 
(61) 



F(.eLg(i.-t-'))>-|, 



aeLg ^2 



-,a,c:\a:c\ 



> 



First, let us look at only the boundary between a and b in direction 1. Let us fix a vector z"^^ which 
gives a large fiber Bi (^z"jfj 



for the boundary ^"'^'['^■''1^ j g. we know that 



(62) 

Our basic goal in the following will be to show that conditional on the ranking profile a being in the fiber 
F (^z'^fj (but not necessarily in F ^z"f ^ ), with high probability the outcome of the vote is top^-^ f,| (""i); or 
else we have a lot of 2-manipulation points or local dictators on three alternatives in coordinate 1. 

Our first step towards this is the following. 

Lemma 7.18. Suppose z^'J* gives a large fiber Bi ^z^f^ for the boundary ^"''''["•''1 , Then 
(63) P^'^ (a e Bi \aeF (z^f)) > 1 - k^. 



Proof. We know that 



'((ai(l),ai (2)) = (a, 6) 



aeF(z«t)) = 



1 



k-r 



and so 



j,(a,f)) 



= (fc - 1) P (<7 ^ Bi (z^f ) , (ai (1) , ai (2)) = (a, &) I a e ^ (z^f )) < (fc - 1) 7 < fc7- 
The next lemma formalizes our goal mentioned above. 
Lemma 7.19. Suppose z"f gives a large fiber Bi for the boundary B^' ^. Then either 

(64) P (/ (a) = top{,,,j (ai) I a e F (z^f ) ) > 1 - 2kj 



or 



(65) 



□ 



(66) 



(o- e LDi (a, b) aeF (z^^f ) ) > 



2fc' 



Proof. The proof of this lemma is essentially the same as that of Lemma I6.15[ there are only two slight 
differences. First, we use Lemma [7. 181 to know that holds. Second, we take a € F (^zt'fj to be uniform, 

and we stay on the fiber F ^z^'J*^ throughout the proof: we modify only the first coordinate throughout the 
proof, in the same way as we did for Lemma 16.151 We omit the details. □ 
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Now this lemma holds for all vectors which give a large fiber Bi ^^^^f^ for the boundary B° 
By (|60p wc know that 



a,6; [a:6] 



cr : Bi [x"^-^ (cr)j is a large fiber j > 



Now if (p5)) holds for at least a third of the vectors that give a large fiber Si (^z'^'[^ , then it follows that 



'{<jeM2)> 



6nk^ 

and we are done. If holds for at least a third of the vectors z^'J" that give a large fiber Bi (^z'^fj , then 
similarly we have 

P(aeLDi ia,b))>-^, 

which means that ([501 also holds, and so we are done by the argument in Section [72II1 

So the remaining case to consider is when ((64)) holds for at least a third of the vectors z^'J* that give a 

large fiber Bi (^z'^fj . 

We can go through this same argument for the boundary between a and c in direction 2 as well, and 
either we are done because 

P(o-eM2)> '^^ 



'{a e LD2 (a,c)) > 



7e 



or for at least a third of the vectors that give a large fiber B2 (z^'l) we have 

P (/ (a) top{,,,} (a2) \aeF (z^^^)) > 1 - 2kj. 
So basically our final case is if 

(67) K'^^^^i^si^ 
and also 

(68) ^(-^^^"■^)^^- 

Notice that being in the set F^'^ only depends on the vector x°-'^ (a) of preferences between a and b, and 
similarly being in the set _F1^''' only depends on the vector x""'"^ (cr) of preferences between a and c. We know 

that I ^a;"''' (cr) , (cr)^ | are independent, and for any given i we know that E (cr) x"'"^ (cr)^ — ^ 

Hence we can apply reverse hypercontractivity (Lemma I2.4p . to get the following result. 

Lemma 7.20. // and (UHl) hold, then also 



3- 



(69) P (cr e Ff 



2 ; - 27n3fc2i ■ 

Proof. See above. □ 

The next and final lemma then concludes that we have lots of manipulation points. 
Lemma 7.21. Suppose (p^ holds. Then 

97 



(70) P (ct e M3) > 
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Proof. First let us define two events: 

h {o- : / M = top{<,_fc} (f7i)| 
h {o- : / {<y) top{^^j (era)} 
Using similar estimates as previously in Lemma 13.31 we have 

P (a e /i n /2 n F^^'' n ^2"^'=) >¥(a e F^-^ n f!^''' 



aih.ae n F2"'"j - P (^a ^ /a, a G Ff n F 

The first term is bounded below via while the other two terms can be bounded using the definition of 
F"''' and F2"''^, respectively: 

P (fj ^ /i, (7 e Ff''' n F2"'") < P (cr ^ Ji, (7 G Ff < P (cr ^ /i ere F^-^^ < 2^7, 

and similarly for the other term. Putting everything together gives us 

p (a G /i n /2 n F^'" n F2-^) > - 4kr 

If (T G /i n /2 n Fi'^ n F^''^, then clearly we must have / {a) = a, and therefore (cr) = 1 and (cr) = 1. 
Now define a' from a by bubbling up b in coordinate 1 to just below a, and bubbling up c in coordinate 2 to 
just below a. Either we encounter a 2-manipulation point along the way, or the outcome is still a: f {a') — a. 
If we encounter a 2-manipulation point along the way for at least half of such ranking profiles, then we are 
done: 



2k'-f 



£^ 27 
54n'^fc2-^ k 



Otherwise, we may assume that 



a G /i n /2 n F^^ n F:^^\ f {a') = a) > ~ 2kj. 



In this case define <t' := [a:b]^(T' and a" := [a:c]2(j'- If / (o'') ^ {a, &} or f {cr") ^ {a, c}, then we 
automatically have that one of a' ^a' ,a" is a 2-manipulation point. If / ((t') = 6 and f {cf") = c, then by 
Lemma 15.21 we know that there exists a 3-manipulation point a which agrees with cr except perhaps a, 6, 
and c could be arbitrarily shifted in the first two coordinates. The final case is when a G {/ (cr') , / ((j")}- 
But we now show that this has small probability, and therefore (f70| follows. 
First let us look at the case of / {a') = a. We have 



<T G /i n /2 n F^^' n F^^', / (a') = a, / {a') = a 



= E 
^ E 



' (a G /i n /2 n F ((1, z^f)) n F^, f {a') = aj {a') = a) 



aehnhn F^'\ f {a') =aj (a') = a 
(a:f{a')=a 



aGF((l,z^f 



aGF((l,z^f 



aGF((l,z^t 



aGF((l,z^f 



and we also know that 



Now we know that ct' G F (^(^-1, C Ff ■ 
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The number of cr's that give the same a' is at most fc^, and so we can conclude that 
P (d e /i n /2 n Ff n F!^-\ f (a') = a, / {a') = a) < 4fc3^, 

and similarly 

P (cr e /i n /2 n n F2'''^ / (a') = a, / (a") = a) < Ak^j, 

which shows that 

1 / \ g-, 

e M3) > ^ - 2fc, ~ Sfc^j > _^ _ -J. □ 

To conclude the proof in this case, recall that we have chosen 7 = Yg^^rpr- 
7.3.2 Case 2 

First, as in the previous case, we can look at simply the boundary between a and b in direction 1, and 
conclude that either there are many manipulation points, or there are many local dictators, or ([57)) holds. 
This holds similarly for the boundary between c and d in direction 2. Finally, just as in Section |3.3.2[ we 
can show that (l67t and ([68]) cannot hold at the same time. We omit the details. 

7.4 Proof of Theorem 17.11 concluded 

Proof of Theorem ]?. 1\ Our starting point is Lemma [531 which directly implies Lemma 17.21 (unless there are 
many 2-manipulation points, in which case we are done). We then consider two cases, as indicated in Section 

rm 

We deal with the small fiber case in Section 17.21 First, Lemmas 17. 3[ 17.61 and 17.81 and Corollaries 17.41 
17.51 17.71 and 17.91 implv that either there are many 3-manipulation points, or there are many local dictators 
on three alternatives in coordinate 1. We then deal with the case of many local dictators in Section [7.2.11 
Lemma FTTOl Corollary [TTll Lemmas m2l [7T3l Corollarv Oil and Lemmas [7T5l [7T6l and iTTTl together 
show that there are many 4-manipulation points if there are many local dictators on three alternatives, and 
the SCF is e-far from the family of nonmanipulable functions. 

We deal with the large fiber case in Section 17.31 Here Lemmas 17.181 17.191 17.201 and 17.211 show that if 
there are not many local dictators on three alternatives, then there are many 3-manipulation points. In the 
case when there are many local dictators, we refer back to Section [7. 2. 11 to conclude the proof. □ 



8 Reduction to distance from truly nonmanipulable SCFs 

In this section we prove Theorem 11.51 which says that if our SCF / is close to NONMANIP, then it is also 
close to NONMANIP, or else we have lots of manipulation points. Consequently, this means that if we can 
prove a quantitative Gibbard-Satterthwaite theorem with distance measured from NONMANIP, then we 
can also prove a quantitative Gibbard-Satterthwaite theorem with distance measured from NONMANIP. In 
particular. Theorem 11.21 is what we get when we combine Theorems 17.11 and 11.51 

Proof of Theorem ] 1.5[ Our assumption means that there exists a SCF g E NONMANIP such that D (/, g) < 
a. We distinguish two cases: either g is a function of one coordinate, or g takes on at most two values. 

Case 1. 5 is a function of one coordinate. In this case we can assume w.l.o.g. that g is a function 
of the first coordinate, i.e. there exists a SCF h : Sk ^ [k] on one coordinate such that for every ranking 
profile cr, we have g (a) = h (cri). 

We know from the quantitative Gibbard-Satterthwaite theorem for one voter that for any /3 either 
D (/i, NONMANIP (1, fc)) < /?, or P (a e M3 (/i)) > j^. 

In the former case, we have that 

D (g, NONMANIP {n, fc)) < D {h, NONMANIP (1, fc)) < /3, 
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and so consequently 

D (/, NONMANIP (n, k)) < a + p. 

In the latter case, we have that 



P(aeM3(ff)) = P(aeM3(/i))> 

and so consequently 



105fcl6' 



P(aGA/3(/))>^^-6nfca, 

since changing the outcome of a SCF at one ranking profile can change the number of 3-manipulation points 
by at most 6nfc. Now choosing j3 = lOOnfc^a^/'^ shows that either or holds. 

Case 2. g is a function which takes on at most two values. W.l.o.g. we may assume that the 
range of g is {a, b} C [A:], i.e. for every ranking profile a d SJI: we have g (a) e {a, b}. 

There is one thing we have to be careful about: even though g takes on at most two values, it is not 
necessarily a Boolean function, since the value of g (cr) does not necessarily depend only on the Boolean 
vector x""'^ (a). 

We now define a function h : ^ {a, 6} that is close in some sense to g and which can be viewed 
as a Boolean function h : {a, &}" — ?> {a, 6} because h{a) depends on a only through x°''^ (cr). (The vector 
rj.a,b j-^-j g {—1, 1}" encodes which of a and b is preferred in each coordinate, and a vector in {a, 6}" can encode 
the same information.) For a given ranking profile cr, let us consider the fiber on which it is on, F [x°''^ (cr)), 
and let us define g\F{x'^'i'{a)) to be the restriction of g to ranking profiles in the fiber F [x""'^ (cr)) . Then define 
(see Definition [22]) 

/i(cr) := Maj {g\F{x'-.^(a))) ■ 

By definition, ft. (cr) depends on a only through x""'^ {a)^ so we may also view /i as a Boolean function 
h : {a,&}" {a, 6}. 

For any given < (5 < 1, we either have D [g, h) < S, in which case D (/, /i) < a + S, or if D (g, h) > S, 
then we show presently that 

(71) P(^gM2(/))> ^-nfca. 

Choosing S ~ Sn'^k^a then shows that either ([4]) holds, or D (/, h) < Qri^k^a. 

Let us now show (j7ip . We use a canonical path argument again, but first we divide the ranking profiles 
according to the fibers with respect to preference between a and b. 

Let us consider an arbitrary fiber F (z"'**) , and divide it into two disjoint sets: into those ranking profiles 
for which the outcome of g and h agree, and those for which these outcomes are different. Le. 

F [z"^^) = (z°'^) U F™'" (z"''') , 

where 

^maj ^^a,fc^ ^ {a G F [z"-^) : g (a) = h {a)] , 
j^min ^^-fi) = {^^F{z'^-''):g{a)^h{a)]. 

By construction, we know that 

Now for every pair of profiles (cr,cr') g F'"'" (z°^^) x 

^maj ^2°^'') define a canonical path from a to a' by 
applying a path construction in each coordinate one by one, and then concatenating these paths. In each 
coordinate we apply the path construction of [15l Proposition 6.6.]: we bubble up everything except a and 
b, and then finally bubble up the last two alternatives as well. 
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For a given edge (tt.tt') e {z°-^) x (z"''') there are at most 2fc-* (f )" possible pairs (cr,CT') G 

F™™ (z'''^) X (z"''') such that the canonical path between a and cr' defined above passes through (tt, tt'). 
(This can be shown just like in the previous lemmas, e.g. Lemma 17.31 ) Consequently we have 



I ^min ^^a.fc^ | | ^inaj ^^a.fc^ | 



> 



2fc4 (I)" - 4fc4 



where the edge boundary de (F™'" (z"''')) is defined via the refined rankings graph restricted to the fiber 
F (z"^''). Summing this over all fibers we have that 

I pmin f ^a,h\ I ^ 

(72) E (^'"" I ^ E ' 4fcV^' > iii (^0" , 

using the fact that D {g, h) > 6. 

Now it is easy to see that if (a, cr') e 9e (i^""" (2''''')) for some z"' , then either a or ct' is a 2-manipulation 
point for g. In the refined rankings graph every vertex (ranking profile) has n{k — 1) < nk neighbors, so 
each 2-manipulation point can be counted at most nk times in the sum on the left hand side of ()72p . showing 
that 

n^^MAg))>J^^^ 

from which (j7ip follows immediately, since changing the outcome of a SCF at one ranking profile can change 
the number of 2-manipulation points by at most nk. 

So either we are done because ^ holds, or D (/, h) < 9n^k^a; suppose the latter case. Our final step is 
to look at ft- as a Boolean function, and use a result on testing monotonicity [12 . 

Denote by D the distance of h when viewed as a Boolean function from the set of monotone Boolean 
functions. Let < e < 1 be arbitrary. Then either D < e, in which case D (ft, NONMANIP) < D < e and 
therefore D (/, NONMANIP) < 9n'^k^a e, or D > e. In the latter case we show that then 

(73) P (cr e M2 (/)) > ^ - 9n^k^a. 

nk 

Choosing e = 5n^k^a then shows that either ^ or (|4]) holds. 

Let us now show (j73p . Let us view ft. as a Boolean function, and denote by p (ft) the fraction of pairs 
of strings, differing on one coordinate, that violate the monotonicity condition. Goldreich, Goldwasser, 
Lehman, Ron, and Samorodnitsky showed in [T^l Theorem 2] that p{h) > ^. 

Now going back to viewing ft as a SCF on k alternatives, this tells us that there are at least |2" pairs 
of fibers, which differ on one coordinate, that violate monotonicity. For each such pair of fibers, whenever 
a and b are adjacent in the coordinate where the two fibers differ, we get a 2-manipulation point. Such a 
2-manipulation point can be counted at most n times in this way (since there are n coordinates where a and 
b can be adjacent). Consequently, we have 

|M,W|>£.2.'-2,.-l,!(|)".i.|(«,", 

i.e. 

P(ae M2(ft)) > ^, 

nk 

from which (j73p follows immediately, since changing the outcome of a SCF at one ranking profile can change 
the number of 2-manipulation points by at most nk. □ 



Proof of Theorem ] l.SH First we argue without specific bounds. Suppose on the contrary that our SCF / 



does not have many 4-manipulation points. Then / is close to NONMANIP by Theorem 1 7. II Consequently, 
by Theorem 11.51 / is close to NONMANIP, which is a contradiction. 
Now we argue with specific bounds. Assume on the contrary that 



>(aeM4(/)) < 



1039^67^166 ■ 
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Then by Theorem Owe have that D (/,NONMANIP) 

< ]^Q6^i:jfc^4 : and consequently by Theorem 11.51 we 
have D (/, NONMANIP) < e, which is a contradiction. □ 

9 Open problems 

We conclude with a few open problems that arise naturally, some of which have already been asked by 
Isaksson, Kindler and Mossel |15j . 

• In Section [1.31 we mentioned that our techniques do not lead to tight bounds. It would be interesting 
to find the correct tight bounds. When discussing tight bounds there are various different ways to 
measure the manipulability of a function: in terms of the probability of having manipulating voters, 
in terms of the expected number of manipulating voters, in terms of the number of manipulative edges 
(either in the refined or non-refined graph), etc. 

• A related question is to find, in some natural subsets of functions, the one that minimizes manipulation. 
For example, among anonymous SCFs, which function minimizes the expected number of manipulating 
voters? For example, for plurality, the probability that a ranking profile is manipulable is Q (l/-\/n), 
and if it is manipulable, then Q (n) voters can manipulate individually, so consequently the expected 
value of the number of voters who can manipulate individually is Q Is it true that for all 
anonymous SCFs, this expectation is {\/n)7 

• What if the distribution over rankings is not i.i.d. uniform? It would be interesting to consider a 
quantitative Gibbard-Satterthwaite theorem, and also the questions asked above, in this setting. 

Acknowledgments 

We thank Ariel Procaccia, Gireeja Ranade and Piyush Srivastava for helpful discussions, and Joe Neeman 
for valuable comments on a draft of the paper. We also thank anonymous referees for useful comments. 

References 

[1] K. Arrow. A Difficulty in the Concept of Social Welfare. Journal of Political Economy, 58:328-346, 
1950. 

[2] K. Arrow. Social Choice and Individual Values. Yale University Press, 2nd edition, 1963. 

[3] J.J. Bartholdi III, C.A. Tovey, and M.A. Trick. The Computational Difficulty of Manipulating an 
Election. Social Choice and Welfare, 6(3):227-241, 1989. 

[4] C. Borell. Positivity improving operators and hypercontractivity. Mathematische Zeitschrift, 180(2):225- 
234, 1982. 

[5] V. Conitzer and T. Sandholm. Nonexistence of Voting Rules That Are Usually Hard to Manipulate. In 
Proceedings of the 21st National Conference on Artificial Intelligence, volume 21, pages 627-634, 2006. 

[6] S. Dobzinski and A.D. Procaccia. Frequent Manipulability of Elections: The Case of Two Voters. In 
Proceedings of the 4th International Workshop on Internet and Network Economics, pages 653-664. 
Springer, 2008. 

[7] C. Dwork, R. Kumar, M. Naor, and D. Sivakumar. Rank Aggregation Methods for the Web. In 

Proceedings of the 10th International Conference on World Wide Web, pages 613-622. ACM, 2001. 

[8] P. Faliszewski and A.D. Procaccia. AI's War on Manipulation: Are We Winning? AI Magazine, 
31(4):53-64, 2010. 

[9] E. Friedgut, G. Kalai, N. Keller, and N. Nisan. A Quantitative Version of the Gibbard-Satterthwaite 
Theorem for Three Alternatives. SIAM J. Comput., 40(3):934-952, 2011. 



45 



[10] E. Friedgut, G. Kalai, and N. Nisan. Elections can be manipulated often. In Proceedings of the 49th 
Annual Symposium on Foundations of Computer Science, pages 243-249. IEEE, 2008. 

[11] A. Gibbard. Manipulation of Voting Schemes: A General Result. Econometrica: Journal of the Econo- 
metric Society, pages 587-601, 1973. 

[12] O. Goldreich, S. Goldwasser, E. Lehman, D. Ron, and A. Samorodnitsky. Testing Monotonicity. Com- 
binatorica, 20(3):301-337, 2000. 

[13] L.H. Harper. Global Methods for Combinatorial Isoperimetric Problems. Cambridge University Press, 
2004. 

[14] J.C. Hernandez. Dead Man on Ballot: An Election Mystery, h ttp : //www . nyt ime s . coiii/20 11/01 730/ 
|nyregion/30dead.htiiil| January 28, 2011 (retrieved online October 24, 2011). 

[15] M. Isaksson, G. Kindler, and E. Mossel. The Geometry of Manipulation: A Quantitative Proof of the 
Gibbard-Satterthwaite Theorem. To appear in Combinatorica, 2011. 

[16] J.S. Kelly. Almost all social choice rules are highly manipulable, but a few aren't. Social Choice and 
Welfare, 10(2):161-175, 1993. 

[17] J.H. Lindsey. Assignment of Numbers to Vertices. American Mathematical Monthly, pages 508-516, 
1964. 

[18] E. Mossel. A quantitative Arrow theorem. To appear in Probability Theory and Related Fields, 2011. 

[19] E. Mossel, R. O'Donnell, O. Regev, J.E. Steif, and B. Sudakov. Non-interactive correlation distil- 
lation, inhomogeneous Markov chains, and the reverse Bonami-Beckner inequality. Israel Journal of 
Mathematics, 154(l):299-336, 2006. 

[20] H. Moulin. The Strategy of Social Choice. North-HoUand, 1983. 

[21] A.D. Procaccia and J.S. Rosenschein. Junta Distributions and the Average-case Complexity of Manip- 
ulating Elections. Journal of Artificial Intelligence Research, 28:157-181, 2007. 

[22] M.A. Satterthwaite. Strategy-proofness and Arrow's Conditions: Existence and Correspondence Theo- 
rems for Voting Procedures and Social Welfare Functions. Journal of Economic Theory, 10(2):187-217, 
1975. 

[23] S. Tawa. Philadelphia Voting Machines Pass Inspection, Even If All Ballot Names 
Don't. |http : / /philadelphia.cbslocal . com/2bll/05/06/philadelphia-voting-machines-pass-| 
inspect ion-even- if -all-ballot-names-dont/. May 6, 2011 (retrieved online October 24, 2011). 

[24] L. Xia and V. Conitzer. A Sufficient Condition for Voting Rules to be Frequently Manipulable. In 
Proceedings of the 9th ACM Conference on Electronic Commerce, pages 99-108. ACM, 2008. 



46 



